Skip to content

Conversation

@mengqin
Copy link

@mengqin mengqin commented Dec 1, 2025

This is basic Sage Attention 3 support. Because it is still unstable and differs significantly from previous versions of Sage Attention, a separate switch --use-sage-attiention3 is provided to enable or disable it. You need to install Sage Attention 3 in your environment before enabling it.

Attention 3 takes effect is reduced to 1024 (although the improvement is
not significant at this scale).
@mengqin mengqin requested a review from guill as a code owner December 2, 2025 16:21
@mengqin mengqin requested a review from rattus128 December 2, 2025 16:26
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants