Meet the Radeon™ GPU Profiler, a ground-breaking low-level optimization tool that provides detailed information on Radeon™ GPUs.
| DOWNLOAD - Latest version 1.8
This release adds the following:
- Completely redesigned Radeon Developer Panel.
- Many instruction timing improvements:
- Full frame instruction timing data collection.
- Improved accuracy of timing and hardware utilization data.
- Improved support for RDNA wave32/wave64 modes.
- Improved UI handling of running at different DPI display settings.
- Support for running on Ubuntu 20.04.
- Many bug/stability/usability fixes.
Unlike the black box approach of the past, PC game developers now have unprecedented, in-depth access to a GPU and can easily analyze graphics, async compute usage, event timing, pipeline stalls, barriers, bottlenecks, and other performance inefficiencies.
This unique tool generates easy to understand visualizations of how your DirectX®12, Vulkan®, and OpenCL™ applications interact with the GPU at the hardware level. Profiling a game is both a quick and simple process using the Radeon™ Developer Panel and our public GPU driver.
Figure out your frame
Get a bird’s eye view of how your command buffers got submitted to each GPU queue.
Understand how your graphics, async compute, and copy workloads interact and synchronize.
Wade through your wavefronts
Undestand how your wavefronts were pushed through the GPU. We can also correlate between wavefronts and the GPU events which launched them.
The data displayed in this view is highly filterable, groupable, and includes a side panel with added detail about user selections.
Speed up your shaders
Quickly and easily find hotspots in your shaders using the instruction timing view.
Each instruction in your RDNA ISA has a bar showing its average latency, allowing you find the right things to optimize.
Banish those barriers!
Find out which barriers flushed caches, caused a synchronization point or even ran their own, internal shaders.
Burst those pipeline bubbles and claim back your performance.
- Radeon™ RX 5500 series and RX 5300 series
- Radeon™ RX 5700 and RX™ 5700 XT
- Radeon™ VII
- Radeon™ RX Vega
- Ryzen™ Processors with Radeon™ Vega Graphics
- Radeon™ R9 Fury, Fury X and Fury Nano
- Radeon™ RX 400 and RX 500 series
Supported graphics APIs
- DirectX® 12
Supported compute APIs
- Windows® 10
- Linux – Ubuntu 18.04.3 LTS
- Linux – Ubuntu 20.04.1 LTS
| VERSION HISTORY
- Support for Radeon™ RX 5500 and Radeon™ RX 5300 hardware.
- New Pipelines Overview pane to summarize pipeline usage for the profile.
- Pipelines and Pipeline state views will indicate if a shader was compiled using wave32 vs. wave64 on RDNA hardware.
- In the Barriers pane, additional cache levels (L0/L1/L2) are shown for invalidates on RDNA hardware.
- The Most Expensive Events and Render/Depth Targets panes now have sortable table columns.
- The Frame Summary and Profile Summary panes now show the amount of profiling overhead (the amount of video memory and bandwidth consumed by profile data collection)
- Add Overlays in the Wavefront Occupancy Event Timeline view to view User events, Hardware contexts, Command buffers and Render targets.
- Improved Instruction Timing to increase accuracy of timing data.
- Improved zoom control UI in the various panes that support zooming.
- Improved UI when running at different DPI settings.
- Bug/stability fixes.
- Add support for RX5700 and RX5700XT.
- Added instruction timing search.
- Support for displaying profiles taken with instruction tracing data
- Support for displaying user events in the Wavefront Occupancy timeline view
- Support to display GCN ISA disassembly in the Pipeline state view
- Support for showing and colorizing API PSO hash for each event
- New grouping modes based on API PSO hash
- Improved grouping of events and waves
- Additional state bucket to support API PSO hashes
- Barriers pane now has sortable columns in the table
- Version number added to title bar
- A Check For Updates feature has been added to alert users when a new version of the tool is available
- Support to display OpenCL profiles
- Fix “Stall due to context rolls” table entry in the Events table on the Context rolls pane to toggle
time units correctly
- Enable search string to search by user marker if selecting “color by user marker” in the Event timing
- Display an error dialog box if the profile being loaded exceeds the maximum number of events supported by RGP
- Fix duplicate user event strings being displayed in the events side panel
- Fix bug in the event-to-bucket lookup when grouping by state bucket in the Event timing pane
- Bug/stability fixes & UI clean ups
- Replace the pie charts in the side details pane with a bar graph
- API Shader resource usage table added to the single event side panel, showing
register and LDS usage along with theoretical wavefront occupancy
- Addition of a pie chart showing the number of queue submissions on the frame summary pane
- Addition of a render/depth targets overview pane
- Bug/stability fixes & UI clean ups
- Adds interop between RGP and RenderDoc. An RGP profile can be taken from RenderDoc and events can be selected in one tool and displayed in the other. To use the interop features, install the latest driver. This is 18.5.1 or higher for Windows, or 18.10 for Linux. RenderDoc V1.1 now supports the interop feature, available here: https://renderdoc.org/
- Shows presentation indicators in the System Activity section (reliant on latest driver)
- Adds feature to find an event’s parent command buffer (RGP panes –> System Activity)
- Adds feature to find a command buffer’s first child event (System Activity –> RGP panes)
- For driver-inserted barriers, shows the reason why the barrier was inserted. Shown on the barriers pane and in the details pane when an event is selected in one of the Events panes
- Adds support for Ryzen 5 and Ryzen 3 Processors with Radeon Vega Graphics (aka “Raven Ridge”)
- GPU only view option added to system activity view
- Placed system activity checkboxes inside new pulldowns: Workload views and CPU submission markers
- Color by command buffer added to Event Timing and wavefront occupancy timeline views
- Barriers & layout transitions extended to show whether they originated from the application or the driver. Text indicating the reason for the barrier will be shown if available
- Fixed-function work shown as part of an event in the Event timing pane and the timeline in the wavefront occupancy view
- Actionable context rolls, showing the state changes that caused them
- Improved wavefront occupancy resolution settings
- Support for PIX3 user markers
- Updated to use Qt5.9.2
- Bug/stability fixes & UI clean ups
- This is the first public release of the Radeon GPU Profiler
| RELATED TO RADEON™ GPU PROFILER
Links to AMD Presentations shown at GDC 2019
Learn what a context roll on our GPUs is, how they apply to the pipeline and how they’re managed, and what you can do to analyse them and find out if they’re a limiting factor in the performance of your game or application.
Links to AMD Presentations shown at GDC 2018
Due to architectural differences between Zen and our previous processor architecture, Bulldozer, developers need to take care when using the Windows® APIs for processor and core enumeration.
Sebastian Aaltonen, co-founder of Second Order Ltd, talks about how to optimize GPU occupancy and resource usage of compute shaders that use large thread groups.
A guide to using the Windows Performance Analyzer tool, with a focus on video resources.
| OUR OTHER TOOLS
Radeon™ Memory Visualizer (RMV) is a tool to allow you to gain a deep understanding of how your application uses memory for graphics resources.
This is a Visual Studio® Code extension for the Radeon GPU Analyzer (RGA). By installing this extension, it is possible to use RGA directly from within Visual Studio Code.
If you want to know how well a game is performing on your machine in real-time with low overhead, OCAT has you covered.
Radeon GPU Analyzer is an offline compiler and performance analysis tool for DirectX®, Vulkan®, SPIR-V™, OpenGL® and OpenCL™.
Compressonator is a set of tools to allow artists and developers to more easily work with compressed assets and easily visualize the quality impact of various compression technologies.