Radeon Compute Profiler (RCP)


The Radeon Compute Profiler (RCP) is a performance analysis tool that gathers data from the API run-time and GPU for OpenCL™ and ROCm/HSA applications. This information can be used by developers to discover bottlenecks in the application and to find ways to optimize the application’s performance.

RCP was formerly delivered as part of CodeXL with the executable name “CodeXLGpuProfiler”. Prior to its inclusion in CodeXL, it was known as “sprofile” and was part of the AMD APP Profiler product.

The Radeon Compute profiler is available for both Microsoft Windows® and Linux®, and can be downloaded from here.

The source code can be found here.

Key Features

  • Measure the execution time of an OpenCL™ or ROCm/HSA kernel.
  • Query the hardware performance counters on an AMD Radeon graphics card.
  • Use the CXLActivityLogger API to trace and measure the execution of segments in the program.
  • Display the IL/HSAIL and ISA (hardware disassembly) code of OpenCL™ kernels.
  • Calculate kernel occupancy information, which estimates the number of in-flight wavefronts on a compute unit as a percentage of the theoretical maximum number of wavefronts that the compute unit can support.
  • When used with CodeXL, all profiler data can be visualized in a user-friendly graphical user interface.

Technical Blogs