Add Sparse Attention Kernels on Hopper by interestingLSY · Pull Request #98 · deepseek-ai/FlashMLA

interestingLSY · 2025-09-29T09:03:19Z

Add sparse attention kernels for DeepSeek-V3.2, for the Hopper architecture. This includes:

Sparse attention kernel for the prefill stage
Sparse attention kernel for the decoding stage, with FP8 KV cache and paged attention support

faruknane · 2025-10-01T22:52:46Z

@interestingLSY does it support Blackwell architecture as well?

interestingLSY added 4 commits September 24, 2025 14:22

Reorganize files and add sparse prefill/decoding kernels on hopper

c28eca9

Add a comment

87709cf

Fill in link to DSv3.2 paper

7232d69

Merge remote-tracking branch 'github/main' into open-source-h

3969f20

interestingLSY merged commit 1794455 into main Sep 29, 2025

interestingLSY deleted the open-source-h branch September 29, 2025 10:06

Provide feedback