Skip to content

[Shardformer] change qwen2 modeling into gradient checkpointing style#5874

Merged
ver217 merged 1 commit intohpcaitech:mainfrom
CjhHa1:qwen2_ckpt
Jul 1, 2024
Merged

[Shardformer] change qwen2 modeling into gradient checkpointing style#5874
ver217 merged 1 commit intohpcaitech:mainfrom
CjhHa1:qwen2_ckpt

Commits

Commits on Jul 1, 2024