Skip to content
Professional Compute
  • GPUOpen
  • Games & CGI
  • Professional Compute
  • Archive
  • Like
    RDNA Shader Instruction Set Architecture document is now available

    This is a very short blog post to let everyone know that the RDNA Shader Instruction Set Architecture reference guide is now available. The document …

    14 4 08/02/2019
    CodeXL 2.6 is released!

    A new version of the CodeXL open-source developer tool is now available! What’s New? For current users of CodeXL, this new release may look and …

    10 2 10/15/2018
    ROCm Tensorflow 1.8 Release

      We are excited to announce the release of ROCm enabled TensorFlow v1.8 for AMD GPUs. This post demonstrates the steps to install and use …

    58 27 07/06/2018
    TrueAudio Next Version 1.2 Now Posted to Github

    We have posted the version 1.2 update to the TrueAudio Next open-source library to Github. It is available here. This update has a number of …

    11 2 05/10/2018
    V-EZ brings “Easy Mode” to Vulkan

    Why Vulkan? With advantages like reduced driver overhead and more control over GPUs, Vulkan has become the 3D graphics and compute API of choice for …

    24 0 03/26/2018
    Deferred Path Tracing By Enscape

    Insights from Enscape as to how they designed a renderer that produces path traced real time global illumination and can also converge to offline rendered image quality

    25 0 12/06/2017
    AMD Vega ISA (Instruction Set Architecture) documentation

    Understanding the instruction-level capabilities of any processor is a worthwhile endeavour for any developer writing code for it, even if the instructions that get executed …

    30 7 08/10/2017
    What’s new in HIP and HCC for ROCm 1.6

    I wanted to share an update on “what’s new” with HIP and HCC in ROCm 1.6: HIP has a new home We’re still on GitHub, …

    5 0 07/05/2017
    Developer Quick Start: MIOpen 1.0

    Overview Announcing our new Foundation for Deep Learning acceleration  MIOpen 1.0 which introduces support for Convolution Neural Network acceleration — built to run on top …

    9 5 07/01/2017
    Developer Quickstart: OpenCL on ROCm 1.6

    Overview ROCm 1.6 introduces big updates to our OpenCL compiler and runtime implementation — built on top of the ROCm software stack! This developer release includes …

    10 5 07/01/2017
    Open and Shut: The Case for AMD’s Open-Source Machine Intelligence Software Stack

    A revolution in machine learning In 2012, a research group from the University of Toronto led by Geoffrey Hinton created an earth-shattering advancement to the …

    13 5 07/01/2017
    We ported CAFFE to HIP – and here’s what happened…

    The Challenge CAFFE is a popular machine learning framework created by the Berkeley Vision and Learning Center.  The code base contains more than 55,000 lines of …

    9 1 07/01/2017
    CodeXL 2.3 is released!

    A new version of the CodeXL open-source developer tool is out! Here are the major new features in this release: CPU Profiling Support for AMD …

    7 0 05/10/2017
    Live VGPR Analysis with Radeon GPU Analyzer

    This article explains how to use Radeon GPU Analyzer (RGA) to produce a live VGPR analysis report for your shaders and kernels. Basic RGA usage …

    15 5 03/21/2017
    Using Sub DWord Addressing on AMD GPUs with ROCm

    Introduction Sub DWord Addressing is a feature of the AMD GCN architecture which allows the efficient extraction of 8-bit and 16-bit values from a 32-bit register.  …

    7 7 02/24/2017
    TrueAudio Next Demo and Paper at GameSoundCon

    In 2016, AMD brought TrueAudio Next to GameSoundCon. GameSoundCon was held Sept 27-28 at the Millennium Biltmore Hotel in Los Angeles. GameSoundCon caters to game …

    7 2 02/24/2017
    Using ROCm to leverage HBM: A Matrix-Vector Multiplication Case Study

    HBM The AMD Radeon™ R9 Fury Series graphics cards (Fury X, R9 Fury and the R9 Nano graphics cards) are the world’s first GPU family …

    7 13 11/22/2016
    ROCm 1.2 Rocking Hawaiian Style

    With ROCm 1.2 we are  moving beyond the Fiji Islands of GPU’s to bring a broader selection of  hardware with the inclusion of the Hawaii …

    13 3 09/13/2016
    News from the HIP part of the world

    We’ve been super-busy – so busy that it has been a while since I’ve been able to post. I wanted to pause long enough to …

    14 0 09/12/2016
    AMD GCN Assembly: Cross-Lane Operations

    Cross-lane operations are an efficient way to share data between wavefront lanes. This article covers in detail the cross-lane features that GCN3 offers.  I’d like …

    38 6 08/10/2016
    Blazing CodeXL 2.2 is here!

    A new release of the CodeXL open-source developer tool is out! Here’s the hot new stuff in this release: New platforms support Support Linux systems …

    22 5 08/08/2016
    Extending Support for In-Place Transpose to Compute FFTs Without Using Extra Memory

    Many fast Fourier transform (FFT) algorithms implement an intermediate transpose stage. Traditionally, the transpositions have used an out-of-place approach in the clFFT library – that …

    12 2 07/05/2016
    The Art of AMDGCN Assembly: How to Bend the Machine to Your Will

    The ability to write code in assembly is essential to achieving the best performance for a GPU program. In a previous blog we described how …

    21 9 06/29/2016
    ROCm with Rapid Harmony : Optimizing HSA Dispatch

    We previously looked at how to launch an OpenCL™ kernel using the HSA runtime. That example showed the basics of using the HSA Runtime. Here we’ll …

    6 6 06/15/2016
    HIP Release 0.86 Now Available

    The team just released an update to HIP in version 0.86 which includes several improvements in the functionality and tools. Also we have included several additional …

    10 0 06/09/2016
    ROCm With Harmony: Combining OpenCL, HCC, and HSA in a Single Program

    Introduction In a previous blog we discussed the different languages available on the ROCm platform.  Here we’ll show you how to combine several of these …

    12 8 06/03/2016
    CodeXL 2.1 is out and Searing hot with Vulkan

    A new CodeXL release is out! For the first time the AMD Developer Tools group worked on this release on the CodeXL GitHub public repository, …

    20 0 05/31/2016
    Turbocharge your Graphics and GPU Compute Applications with GPUPerfAPI

    Achieving high performance from your Graphics or GPU Compute applications can sometimes be a difficult task. There are many things that a shader or kernel …

    15 3 05/25/2016
    AMD DOPPEngine – Post Processing on Your Desktop in Practice

    A Complete Tool to Transform Your Desktop Appearance After introducing our Display Output Post Processing (DOPP) technology, we are introducing a new tool to change …

    37 0 05/23/2016
    Rocking ROCm-gdb’s New Features

    ROCm-gdb v1.0 includes new features to assist application developers with understanding their application’s behavior. To get started with ROCm-gdb follow the installation directions and introductory …

    14 3 04/26/2016
    Computing Very Large FFTs on AMD GPUs

    One of the exciting new features that is available in clFFT 2.10 is the ability to compute very large FFTs. By very large, I mean …

    13 2 04/26/2016
    ROCm, Do You Speak My Language?

    The open-source ROCm stack offers several programming-language choices. Overall, the goal is to give you a range of tools to help solve the problem at …

    16 6 04/25/2016
    Getting Started with ROCm: Components, Platforms & Installation

    Are You Ready to ROCK! The ROCm Platform delivers on the vision of the Boltzmann Initiative, bringing new opportunities in GPU Computing Research. On November …

    30 7 04/25/2016
    CodeXL 2.0 is Here and Open Source

    CodeXL major release 2.0 is out! It is chock-full of new features and a drastic change in the CodeXL development model: CodeXL is now open …

    26 6 04/19/2016
    Getting Up to Speed on the CodeXL GPU Profiler with Radeon Open Compute

    With the announcement of the Boltzmann Initiative and the recent releases of ROCK and ROCR, AMD has ushered in a new era of Heterogeneous Computing. …

    7 2 03/16/2016
    HIP release 0.82

    It’s been just under two months since we publicly launched the HIP repository, and I wanted to share a quick update on the work we’ve …

    9 6 03/14/2016
    Can you build a 1 Petaflop DNN or Molecular Dynamics Computing Solution with ROCm in Single Rack?

    It was a critical question we asked ourselves early in the project, but we also asked if we can bring together a solution where you …

    3 0 01/29/2016
    ROCm: Platform For A New Era of Heterogeneous in HPC and Ultrascale Computing

    The ROCm Platform Deliver on the Vison of the  Boltzmann Initiative,  Bringing a New Opportunities in GPU Computing Research On November 16th, 2015, the Radeon Technology …

    23 3 01/26/2016
    GPUOpen, an Uninhibited Path to Science Discovery, Exploring the Limits of Engineering, or Just Creating Your Artistic World of Wonder

    The Open Path to Bring Forward Your Ideas to High-Performance GPU Computing   Welcome to the new Portal I want to welcome you to the new …

    13 2 01/26/2016
    HSAIL GDB: HSAIL-level Debugger With AMD GCN Debug Technology

    Announcing HSAIL GDB Version 1.0 … Today as part of AMD’s GPUOpen initiative, we are happy to announce the release of HSAIL GDB version 1.0 …

    7 0 01/26/2016
    CodeXL Analyzer CLI – Open Source Announcement

    “AMD is releasing open source code for CodeXL Analyzer CLI. This is a performance analysis tool for OpenCL™ kernels, DirectX® shaders and OpenGL shaders. Using …

    12 0 01/26/2016
    HIP to be Squared : An Introductory HIP Tutorial

    This tutorial shows how to get started with HIP.  We’ll take a simple CUDA application, hipify it, and run it on multiple platforms. Editor’s note: …

    22 21 01/26/2016
    A Brief Intro to the Heterogeneous Compute Compiler

    In November, AMD launched the Boltzmann Initiative at Supercomputing 2015 with the goal of enabling developers to more easily employ the full compute potential of …

    11 14 01/26/2016
    Platform-Aware Coding Inside HIP

    Intro The “P” in HIP literally stands for portability – HIP’s full and formal name is the “Heterogeneous-computing Interface for Portability”.  However, even in a …

    5 5 01/26/2016

    HIP Release 0.86 Now Available

    Posted on June 9, 2016November 8, 2016 by Ben Sander
    C++, clang-hipify, G++, HCC, HIP, HIPCC

    The team just released an update to HIP in version 0.86 which includes several improvements in the functionality and tools. Also we have included several additional HIP ports and examples.

    If you’re just getting started, HIP (or Heterogeneous-Computing Interface for Portability) is a portable C++ runtime and kernel language for GPUs, and includes tools to “hipify” CUDA code into the portable C++ language.

    HIP 0.86 Release Notes

    Release:0.86.00
    Date: 2016.06.06

    • Add clang-hipify : clang-based hipify tool.  Improved parsing of source code, and automates
      creation of hipLaunchParm variable.
    • Implement memory register / unregister commands (hipHostRegister, hipHostUnregister)
    • Add cross-linking support between G++ and HCC, in particular for interfaces that use
      standard C++ libraries (ie std::vectors, std::strings).  HIPCC now uses libstdc++ by default on the HCC
      compilation path.
    • More samples including gpu-burn, SHOC, nbody, rtm.  See HIP-Examples.

    Also some info on the previous 0.84 release:

    Release:0.84.01
    Date: 2016.04.25

    – Refactor HIP make and install system:
    – Move to CMake. Refer to the installation section in README.md for details.
    – Split source into multiple modular .cpp and .h files.
    – Create static library and link.
    – Set HIP_PATH to install.
    – Make hipDevice and hipStream thread-safe.
    – Prefered hipStream usage is still to create new streams for each new thread, but it works even if you don;t.
    – Improve automated platform detection: If AMD GPU is installed and detected by driver, default HIP_PLATFORM to hcc.
    – HIP_TRACE_API now prints arguments to the HIP function (in addition to name of function).
    – Deprecate hipDeviceGetProp (Replace with hipGetDeviceProp)
    – Deprecate hipMallocHost (Replace with hipHostMalloc)
    – Deprecate hipFreeHost (Replace with hipHostFree)
    – The mixbench benchmark tool for measuring operational intensity now has a HIP target, in addition to CUDA and OpenCL.  Let the comparisons begin. 🙂

    See mixbench GitHub site for more.

    Related Resources

    • Technical Blogs
    • HIP GitHub Repository
    Ben Sander is a Senior Fellow at AMD and the lead software architect for the ROCm and HSA projects. He has held a variety of management and leadership roles during his career at AMD including positions in CPU micro-architecture, performance modeling, and GPU software development and optimization. Links to third party sites are provided for convenience and unless explicitly stated, AMD is not responsible for the contents of such linked sites and no endorsement is implied.

    0 Comments

    Comment

    Leave a Reply Cancel reply

    Your email address will not be published. Required fields are marked *

    • Terms and Conditions
    • /
    • Privacy
    • /
    • Cookie Policy
    • /
    • Trademarks
    ©2019 Advanced Micro Devices, Inc. OpenCL™ and the OpenCL™ logo are trademarks of Apple, Inc., used with permission by Khronos.