Remove tma padding for fwd inputs#85
Conversation
dianzhangchen
commented
Aug 22, 2025
- Remove padding requirement for FMHA/MLA fwd inputs.
- Bug fix for corner cases.(Cutlass pull request #2480)
|
Hi, @dianzhangchen and met a significant diff between flashmla and torch_attn. Could you check if you can reproduce it? Maybe met some corner case again. |
I noticed the input sequence length is varlen, but the |
|
@dianzhangchen Sorry for my mistake, the precision has no issue when varlen is True.
I print the size and offset in csrc/sm100/common/utils.hpp#L50. Shows a extreme large value. (BTW this is the first issue I encountered. During my initial attempt to reproduce it, I mistakenly set the varlen to False and was therefore unable to reproduce this problem. 😥) |
The error here doesn’t seem to be caused by the fwd kernel. You may open a new issue to report this. Thanks. |
Sure, I open another issue #90 . Thanks for looking into it! |



