AMD GPU Programming Training Course

ROCm is an open-source platform for GPU programming that supports AMD GPUs while also offering compatibility with CUDA and OpenCL. ROCm provides programmers with direct access to hardware details, granting full control over the parallelization process. However, this requires a solid understanding of device architecture, memory models, execution models, and optimization techniques.

HIP is a C++ runtime API and kernel language that enables developers to write portable code capable of running on both AMD and NVIDIA GPUs. It offers a lightweight abstraction layer over native GPU APIs, such as ROCm and CUDA, allowing users to leverage existing GPU libraries and tools.

This instructor-led, live training (available online or onsite) is designed for beginner to intermediate-level developers who want to use ROCm and HIP to program AMD GPUs and harness their parallel processing capabilities.

Upon completing this training, participants will be able to:

Set up a development environment that includes the ROCm Platform, an AMD GPU, and Visual Studio Code.
Develop a basic ROCm program that performs vector addition on the GPU and retrieves results from GPU memory.
Utilize the ROCm API to query device information, allocate and deallocate device memory, transfer data between the host and device, launch kernels, and synchronize threads.
Use the HIP language to write kernels that execute on the GPU and manipulate data.
Apply HIP built-in functions, variables, and libraries to perform common tasks and operations.
Leverage ROCm and HIP memory spaces—such as global, shared, constant, and local—to optimize data transfers and memory access.
Employ ROCm and HIP execution models to manage threads, blocks, and grids, which define the level of parallelism.
Debug and test ROCm and HIP programs using tools like ROCm Debugger and ROCm Profiler.
Optimize ROCm and HIP programs using techniques such as coalescing, caching, prefetching, and profiling.

Format of the Course

Interactive lectures and discussions.
Extensive exercises and practice sessions.
Hands-on implementation in a live-lab environment.

Course Customization Options

To request customized training for this course, please contact us to arrange.

28 hours

Taipei- Concord

39600 NT$ (Online)

39600 NT$ (Classroom)

AMD GPU Programming Training Course

Course Outline

Requirements

Upcoming Courses

AMD GPU Programming

AMD GPU Programming

AMD GPU Programming

AMD GPU Programming

AMD GPU Programming

Related Categories

This site in other countries/regions

Europe

Asia Pacific

North America

South America

Africa / Middle East

Other sites

AMD GPU Programming Training Course

Course Outline

Requirements

Upcoming Courses

AMD GPU Programming

AMD GPU Programming

AMD GPU Programming

AMD GPU Programming

AMD GPU Programming

Related Courses

Developing AI Applications with Huawei Ascend and CANN

Deploying AI Models with CANN and Ascend AI Processors

AI Inference and Deployment with CloudMatrix

GPU Programming on Biren AI Accelerators

Cambricon MLU Development with BANGPy and Neuware

Introduction to CANN for AI Framework Developers

CANN for Edge AI Deployment

Understanding Huawei’s AI Compute Stack: From CANN to MindSpore

Optimizing Neural Network Performance with CANN SDK

CANN SDK for Computer Vision and NLP Pipelines

Building Custom AI Operators with CANN TIK and TVM

Migrating CUDA Applications to Chinese GPU Architectures

Performance Optimization on Ascend, Biren, and Cambricon

Related Categories

GPU

This site in other countries/regions

Europe

Asia Pacific

North America

South America

Africa / Middle East

Other sites