Skip to content

This repository documents the process of optimizing GEMM on the Hopper GPU, including the use of methods such as WGMMA, TMA, producer-consumer pipeline, and persistent kernel.

Notifications You must be signed in to change notification settings

HPC4AI/HopperGEMM

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 

About

This repository documents the process of optimizing GEMM on the Hopper GPU, including the use of methods such as WGMMA, TMA, producer-consumer pipeline, and persistent kernel.

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published