[Shardformer] change qwen2 modeling into gradient checkpointing style#5874
Merged
ver217 merged 1 commit intohpcaitech:mainfrom Jul 1, 2024
Merged
[Shardformer] change qwen2 modeling into gradient checkpointing style#5874ver217 merged 1 commit intohpcaitech:mainfrom
ver217 merged 1 commit intohpcaitech:mainfrom