Skip to content

fix(moe): handle D < tp_size in fp8 _load_w13/_load_w2

969d564
Select commit
Loading
Failed to load commit list.
Draft

feat: add Step-3.5-Flash support and fix MoE weight shuffling on gfx950 #641

fix(moe): handle D < tp_size in fp8 _load_w13/_load_w2
969d564
Select commit
Loading
Failed to load commit list.