Skip to content

revise qwen2 into gradient checkpoint

5761e80
Select commit
Loading
Failed to load commit list.
Merged

[Shardformer] change qwen2 modeling into gradient checkpointing style #5874

revise qwen2 into gradient checkpoint
5761e80
Select commit
Loading
Failed to load commit list.

Workflow runs completed with no jobs