Intel VTune ProfilerIntel Corporation
Use advanced sampling and profiling methods to quickly analyze code, isolate issues, and deliver performance insight on modern CPUs, GPUs, and FPGAs.
Vendor
Intel Corporation
Company Website



Product details
Performance Analysis for Applications & Systems
Intel® VTune™ Profiler optimizes application performance, system performance, and system configuration for AI, HPC, cloud, IoT, media, storage, and more.
- CPU, GPU, and NPU: Tune the entire application’s performance―not just the accelerated portion.
- Multilingual: Profile SYCL*, C, C++, C#, Fortran, OpenCL™ code, Python*, Google Go* programming language, Java*, .NET, Assembly, or any combination of languages.
- System or Application: Get coarse-grained system data for an extended period or detailed results mapped to source code.
- Power: Optimize performance while avoiding power- and thermal-related throttling.
Features
- Algorithm Optimization - Locate hot spots—the most time-consuming parts of your code. - Visualize hot code paths and time spent in each function and with its callees with Flame Graph.
- Microarchitecture and Memory Bottlenecks - Identify the most significant hardware issues that affect the performance of your application with microarchitecture exploration analysis. - Pinpoint memory-access-related issues such as cache misses and high-bandwidth problems.
- Accelerators and XPUs - Optimize GPU offload schema and data transfers for SYCL, OpenCL code, Microsoft DirectX*, or OpenMP* offload code. Identify the most time-consuming GPU kernels for further optimization. - Analyze GPU-bound code for performance bottlenecks caused by microarchitectural constraints or inefficient kernel algorithms. - Understand how much data is transferred between a neural processing unit (NPU) and DDR memory and identify the most time-consuming tasks running on the NPU.