Hack to ensure cp.async is waited before smem reuse by jacobhinkle · Pull Request #2001 · NVIDIA/Fuser

jacobhinkle · 2024-03-26T15:43:04Z

This is a work-around for #2000.

It seems to address the issue in the only current use case for smem reuse: matmul with params.use_smem_epilogue == true. It is not ideal: for example it will insert a cp.async.wait_all instruction even if circular buffering is not used in the kernel.

Fixes #1996 but since this is a hack, I will not mark #2000 as fixed yet.

This is a work-around for #2000.

jacobhinkle · 2024-03-26T15:43:58Z

!build --diff-bench

zasdfgbnm

As a temporary hack to make CI green, this PR is good. But we still need to look into a better solution for this problem.

This just places a `cp.async.wait_group 0` instruction immediately after any circular buffer main loop which is the approach taken by CUTLASS for pipelining GEMMs: (see [mma_multistage.h#L664-L665](https://github.com/NVIDIA/cutlass/blob/c4e3e122e266644c61b4af33d0cc09f4c391a64b/include/cutlass/gemm/threadblock/mma_multistage.h#L664-L665)). The previous fix for #2000, #2001, is reverted. This is an alternative to #2005. Fixes #2000

Hack to ensure cp.async is waited before smem reuse

d45cdcb

This is a work-around for #2000.

jacobhinkle mentioned this pull request Mar 26, 2024

Some matmul nvfuser_splitk benchmark fails #1996

Closed

jacobhinkle marked this pull request as ready for review March 26, 2024 16:44

zasdfgbnm approved these changes Mar 26, 2024

View reviewed changes

jacobhinkle merged commit 2f80cee into main Mar 26, 2024

jacobhinkle deleted the hack_around_async_smem_reuse branch March 26, 2024 22:10

jacobhinkle mentioned this pull request Mar 27, 2024

Drain cp.async jobs after main loop #2008

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Hack to ensure cp.async is waited before smem reuse#2001

Hack to ensure cp.async is waited before smem reuse#2001
jacobhinkle merged 1 commit intomainfrom
hack_around_async_smem_reuse

jacobhinkle commented Mar 26, 2024 •

edited

Loading

Uh oh!

jacobhinkle commented Mar 26, 2024

Uh oh!

zasdfgbnm left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

jacobhinkle commented Mar 26, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jacobhinkle commented Mar 26, 2024

Uh oh!

zasdfgbnm left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

jacobhinkle commented Mar 26, 2024 •

edited

Loading