CUDA & GPGPU Programming

4月2日修改

2023年9月30日创建

2084

2382

Links

Lecture 26: GPU Programming (Fall 2022)

CUDA C++ Programming Guide

Berkeley - Understanding Latency Hiding on GPUs

August 12, 2016

128 页 pdf

reusable software components for every layer of the CUDA programming model

Summary

需要掌握的要点：

GPU 架构入门，cuda framework & 基本术语。

grid/block/thread 的基本概念。

kernel function 声明 & kernel launch

sm / sp 的 hardware execution model

warp，warp scheduler & warp-synchronicity

性能加速方法，总结为两个方向：

IO优化（提高cache命中、使用share memory、register等）

并行规划（让计算与IO、IO与IO、计算与计算互相overlap）

memory copy

stream & async copy

CUDA & GPGPU Programming​