Skip to content

Support MoE per-expert finalize input scales#31

Draft
oazizi000 wants to merge 3 commits into
gimlet_patches/v1.3.0rc10from
oazizi/moe_finalize_scale
Draft

Support MoE per-expert finalize input scales#31
oazizi000 wants to merge 3 commits into
gimlet_patches/v1.3.0rc10from
oazizi/moe_finalize_scale

Conversation

@oazizi000
Copy link
Copy Markdown

Add an optional per-expert element-wise scale factor into the finalize kernel of blockScaleMoe

@oazizi000 oazizi000 marked this pull request as draft May 8, 2026 21:14
@oazizi000 oazizi000 force-pushed the oazizi/moe_finalize_scale branch from c5f2ee1 to 6df640f Compare May 13, 2026 19:09
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant