Logo
Sign in
Product Logo
NVIDIA Transformer EngineNVIDIA

NVIDIA Transformer Engine accelerates Transformer models on NVIDIA GPUs, using FP8 precision for better performance and lower memory utilization.

Vendor

Vendor

NVIDIA

Company Website

Company Website

Product details

NVIDIA® Transformer Engine is a library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs. This provides better performance with lower memory utilization in both training and inference.

Features

  • FP8 Precision: Utilizes 8-bit floating point (FP8) precision for improved performance and reduced memory usage.
  • GPU Optimization: Optimized for NVIDIA Hopper and Ada GPUs.
  • Training and Inference: Enhances both training and inference processes.
  • Comprehensive Documentation: Includes user guides, release notes, and software license agreements.

Benefits

  • Enhanced Performance: Achieves better performance with lower memory utilization.
  • Efficient Resource Usage: Reduces memory usage, making it more efficient.
  • Versatile Application: Suitable for both training and inference of Transformer models.
  • Detailed Documentation: Provides comprehensive documentation for ease of use.
Find more products by category
Development SoftwareView all