data:image/s3,"s3://crabby-images/cd503/cd5039d29ec29f22ae06b29a3ce1dae561fdf590" alt="Logo for AMD FidelityFX Parallel Sort"
AMD FidelityFX™ Parallel Sort provides an open source header implementation to easily integrate a highly optimized compute-based radix sort into your game.
Supports:
- DirectX®12
- Vulkan®
Part of the AMD FidelityFX™ SDK
data:image/s3,"s3://crabby-images/a35c6/a35c64229136e6e61874c535e091968283462e00" alt="AMD FidelityFX SDK components"
Download the latest version as part of FidelityFX SDK v1.1
AMD FidelityFX Parallel Sort:
Clean up of GPU particle code to make better use of FidelityFX Parallel Sort.
Updated as part of AMD FidelityFX SDK v1.1:
- AMD FidelityFX backend updates, including buffer allocator overrides
- Updated documentation and release of reference documentation for SDK + Framework
- Native Microsoft® GDK® backend implementation library (requires developer access to GDK® program)
Features
State-of-the-art algorithm
Optimized for Shader Model 6.0+
Open source, MIT license
Additional features:
- Direct and Indirect execution support.
- AMD RDNA™+ optimized algorithm.
- 32-bit key and payload sort support
- Support for DirectX®12 API and Vulkan®.
- Shaders written in HLSL utilizing Shader Model 6.0 wave-level operations.
A sample application is provided for DirectX®12 and Vulkan®.
Details
Algorithm overview
AMD FidelityFX Parallel Sort is an AMD RDNA™-optimized version of the Radix Sort algorithm.
At a high level, the algorithm works by recursing over a data set to be sorted (key or key/value pairs), and re-arranging it in place by 4-bit increments. Each pass guarantees that the data set is fully sorted up to the number of bits processed. For example, after 4 iterations, we are guaranteed that the first 16 bits of the key is properly sorted.
For each iteration that is executed, 5 actions are taken on the data set:
- The 4-bit value range we are currently sorting is summed up into buckets from 0-15, so that we know how many of each value occurs throughout the data set.
- The number of occurrences go through a reduction phase in order to pre-increment offsets on a thread group basis later on.
- The reduced occurrences go through a scan-prefix to calculate offset values for each value group (0-15) on a thread group basis.
- The full occurrences buffer then also goes through a scan-prefix, and adds the reduced scan-prefix values to properly index the data across all thread groups.
- The data set is read in one more time, and written to its new sorted offset location. If there is also a payload, it is also copied over at this time.
Once all iterations have run (in the case of 32-bit keys, it runs 8 times), the entire data set is sorted.
Additional resources
Comparison: GPU particle sorting
data:image/s3,"s3://crabby-images/4c08d/4c08d1ba3f58e00fb75984c774fa50b964c33abf" alt="Unsorted GPU particles"
data:image/s3,"s3://crabby-images/aacf2/aacf23e25cccfcc3b08ee4dfc2327314afa5e0d5" alt="Sorted GPU particles"
Comparison: image index buffer sorting
data:image/s3,"s3://crabby-images/c7210/c72105810ec0f42c0a71dcb982d38c263b138803" alt="Unsorted image index buffer"
data:image/s3,"s3://crabby-images/44535/445351fbd4c752e12905c53106537ae3dee0c8c0" alt="Sorted image index buffer"
Version history
AMD FidelityFX Parallel Sort:
Clean up of GPU particle code to make better use of FidelityFX Parallel Sort.
Updated as part of AMD FidelityFX SDK v1.1:
- AMD FidelityFX backend updates, including buffer allocator overrides
- Updated documentation and release of reference documentation for SDK + Framework
- Native Microsoft® GDK® backend implementation library (requires developer access to GDK® program)
- Now part of AMD FidelityFX SDK.
- Various fixes applied.
- Vulkan® implementation.
- Upgraded framework to Cauldron v1.4.
- General code cleanup and readability improvements.
- Initial release
Part of the AMD FidelityFX SDK
data:image/s3,"s3://crabby-images/c0906/c09067bb55096ce71616a27d2ae5ec7a2231e21c" alt="AMD FidelityFX SDK"
AMD FidelityFX™ SDK
The AMD FidelityFX SDK is our easy-to-integrate solution for developers looking to include FidelityFX features into their games.
More AMD FidelityFX effects
data:image/s3,"s3://crabby-images/31aae/31aaeed2fb9520231e36f3e193fccbf587f2a725" alt="An image of a toy train blurring into the background"
AMD FidelityFX™ Blur
AMD FidelityFX Blur is an AMD RDNA™ architecture optimized collection of blur kernels from 3×3 up to 21×21.
data:image/s3,"s3://crabby-images/c517c/c517cf160a18c3d9f21574e28f67c85dc0745632" alt="Breadcrumbs-type image"
AMD FidelityFX™ Breadcrumbs library
AMD FidelityFX Breadcrumbs library uses the breadcrumbs marker technique to track down where your submitted commands cause a GPU crash.
data:image/s3,"s3://crabby-images/cc711/cc711e32b6189342169d9700bf33118b28086c5d" alt="AMD FidelityFX Brixelizer GI"
AMD FidelityFX™ Brixelizer/GI
AMD FidelityFX™ Brixelizer GI is compute-based real-time dynamic global illumination solution built upon sparse distance fields.
data:image/s3,"s3://crabby-images/fc585/fc5852d7daaeb24ad777ad24b42a4733441b3fa4" alt=""
AMD FidelityFX™ Cauldron Framework
AMD FidelityFX Cauldron Framework is our open-source experimentation framework for DirectX®12 and Vulkan®, provided in the AMD FidelityFX SDK.
data:image/s3,"s3://crabby-images/e4463/e446399680095fc1e13fbda62d7a20b0b7763c0c" alt=""
AMD FidelityFX™ Combined Adaptive Compute Ambient Occlusion (CACAO)
AMD FidelityFX Combined Adaptive Compute Ambient Occlusion (CACAO) is an AMD RDNA™ architecture optimized implementation of ambient occlusion.
data:image/s3,"s3://crabby-images/2c553/2c553951cfd0a97c6d98fc42c2038abf9b1eea0d" alt="AMD FidelityFX CAS demonstrated by a plane landing"
AMD FidelityFX™ Contrast Adaptive Sharpening (CAS)
AMD FidelityFX Contrast Adaptive Sharpening (CAS) provides a mixed ability to sharpen and optionally scale an image.
data:image/s3,"s3://crabby-images/2612a/2612ad7e09daa4bc40170c38dfb43aaacc296520" alt="Shadow Denoiser"
AMD FidelityFX™ Denoiser
AMD FidelityFX Denoiser is a set of denoising compute shaders which remove artefacts from reflection and shadow rendering.
data:image/s3,"s3://crabby-images/c2224/c22249601fcfeab9b8a75e36fb5534f68fde1de5" alt="FidelityFX Depth of Field - some items on a bench"
AMD FidelityFX™ Depth of Field (DoF)
AMD FidelityFX Depth of Field is an AMD RDNA™-architecture optimized implementation of physically correct camera-based depth of field.
data:image/s3,"s3://crabby-images/dc59e/dc59e1fe105b98d0d8af3ce0823875e012bcb2c7" alt="AMD FidelityFX Hybrid Shadows"
AMD FidelityFX™ Hybrid Shadows sample
This sample demonstrates how to combine ray traced shadows and rasterized shadow maps together to achieve high quality and performance.
data:image/s3,"s3://crabby-images/f8ddd/f8dddeb26a8f15f9ed3f420a287752902f8c6450" alt=""
AMD FidelityFX™ Hybrid Stochastic Reflections sample
This sample shows how to combine AMD FidelityFX Stochastic Screen Space Reflections (SSSR) with ray tracing in order to create high quality reflections.
data:image/s3,"s3://crabby-images/43eb5/43eb5d7a061bc815b238a98ec0d8c50eefee9d6f" alt="FidelityFX Lens - bar room"
AMD FidelityFX™ Lens
AMD FidelityFX Lens is an AMD RDNA™ architecture optimized implementation of some of gaming’s most used post-processing effects.
data:image/s3,"s3://crabby-images/c4b1b/c4b1bbb7166ff7a163f07aa41d356a1626e3aacb" alt=""
AMD FidelityFX™ Luminance Preserving Mapper (HDR Mapper)
AMD FidelityFX LPM provides an open-source library to easily integrate HDR and wide gamut tone and gamut mapping into your game.
data:image/s3,"s3://crabby-images/4cee8/4cee85ca226cb150b89b99e13bf2f71952ca2991" alt="Smaller texture images representing AMD FidelityFX SPD"
AMD FidelityFX™ Single Pass Downsampler (SPD)
AMD FidelityFX Single Pass Downsampler (SPD) provides an AMD RDNA™ architecture optimized solution for generating up to 12 MIP levels of a texture.
data:image/s3,"s3://crabby-images/882e8/882e84fd436a464f7dfbbd77e435971d1e3f0f09" alt="Chess pieces showing reflections depicting AMD FidelityFX Screen Space Reflections"
AMD FidelityFX™ Stochastic Screen Space Reflections (SSSR)
The AMD FidelityFX SSSR effect provides an open-source library to easily integrate stochastic screen space reflections into your game.
data:image/s3,"s3://crabby-images/a38e0/a38e0ed82f3a8358de42682b3946c802065532f4" alt="FSR Cauldron Demo"
AMD FidelityFX™ Super Resolution 1 (FSR 1)
AMD FidelityFX Super Resolution (FSR) is our open-source, high-quality, high-performance upscaling solution.
data:image/s3,"s3://crabby-images/b0154/b01544a862727ce6af3ea03a65e703277bcf1b9c" alt=""
AMD FidelityFX™ Super Resolution 2 (FSR 2)
Learn even more about our new open-source temporal upscaling solution FSR 2, and get the source code and documentation!
data:image/s3,"s3://crabby-images/c97ad/c97adaf5d3ab5d8022ec8966b88bde63850ceb87" alt="AMD FidelityFX Super Resolution 3 spaceship"
AMD FidelityFX™ Super Resolution 3 (FSR 3)
Discover frame generation with AMD FidelityFX™ Super Resolution 3, and get the source code and documentation!
data:image/s3,"s3://crabby-images/d0e63/d0e63ad63b7f180aab68b49c90142d425651fe74" alt="Hangar deck demonstrating AMD FidelityFX Variable Shading"
AMD FidelityFX™ Variable Shading
AMD FidelityFX Variable Shading drives Variable Rate Shading into your game.
Other effects on GPUOpen
data:image/s3,"s3://crabby-images/ab384/ab384baa65d99e564c9e76195579437ab2a8426e" alt=""
TressFX
The TressFX library is AMD’s hair/fur rendering and simulation technology. TressFX is designed to use the GPU to simulate and render high-quality, realistic hair and fur.