Logo
Sign in
Product Logo
NVIDIA Nsight ComputeNVIDIA

NVIDIA Nsight™ Compute is an interactive profiler for CUDA® and NVIDIA OptiX™ that provides detailed performance metrics and API debugging via a user interface and command-line tool. Users can run guided analysis and compare results with a customizable and data-driven user interface, as well as post-process and analyze results in their own workflows.

Vendor

Vendor

NVIDIA

Company Website

Company Website

nsight-compute-utilize-cuda-graphs-interactive-profiling-630x354.jpg
nsight-compute-optimizations-guided-analysis-630x354.jpg
nsight-compute-inspect-memory-workload-630x354.jpg
nsight-compute-uplift-optix-api-development-630x354.jpg
Product details

NVIDIA Nsight™ Compute is an interactive profiler for CUDA® and NVIDIA OptiX™ that provides detailed performance metrics and API debugging via a user interface and command-line tool. Users can run guided analysis and compare results with a customizable and data-driven user interface, as well as post-process and analyze results in their own workflows.

Features

  • Guided Analysis: Provides insights into performance problems and solutions from NVIDIA’s best practices, with detailed metrics on GPU utilization and performance limiters.
  • Memory Workload Analysis: Visualizes memory transfer sizes and throughput, highlighting potential bottlenecks and under-utilizations in the memory pipeline.
  • Source Code Correlation: Correlates efficiency metrics down to individual lines of code, connecting assembly with higher-level code and highlighting areas with high metric values.
  • Interactive Profiling: Creates live sessions where application state can be viewed dynamically, allowing for performance comparisons and resource inspections.
  • Customizable and Collaborative: Extends with custom metric collection and analysis workflows, supports cross-platform development, and enables collaboration through report sharing.

Benefits

  • Enhanced Performance: Identifies and optimizes performance bottlenecks, improving overall application efficiency.
  • Detailed Insights: Provides comprehensive insights into GPU performance, from memory workload to source code efficiency.
  • Flexibility: Supports various development environments and platforms, including desktop, mobile, cloud, and HPC.
  • Ease of Use: Offers user-friendly interfaces and integration with popular development environments.
  • Advanced Profiling: Enables detailed performance analysis and debugging for CUDA, graphics, and deep learning applications.