Register pressure in AMD CDNA2™ GPUs – amd-lab-notes
Register pressure of GPU kernels has a tremendous impact on performance. This post provides a practical demo on applying recommendations.
Register pressure of GPU kernels has a tremendous impact on performance. This post provides a practical demo on applying recommendations.
This post gives an overview of AMD’s open source profiling tools, helping you diagnose bottlenecks and understand how your application is using the hardware.
This post introduces commonly-used memory spaces, identifies what makes each memory space unique, and discusses some common use-cases for each space.