Skip to content

[shardformer] sync gptj config with hf due to flash attn head dim req…

341effc
Select commit
Loading
Failed to load commit list.
Merged

[shardformer] support GPTJ auto model sharding (except lazy init) #4825

[shardformer] sync gptj config with hf due to flash attn head dim req…
341effc
Select commit
Loading
Failed to load commit list.

Workflow runs completed with no jobs