Skip to content

[hotfix] Add layer norm gradients all-reduce for sequence parallel.#4915

Merged
littsk merged 4 commits intohpcaitech:hotfix/sequence_parallelfrom
littsk:hotfix/add_grad_all_reduce_for_sequence_parallel
Oct 16, 2023
Merged

[hotfix] Add layer norm gradients all-reduce for sequence parallel.#4915
littsk merged 4 commits intohpcaitech:hotfix/sequence_parallelfrom
littsk:hotfix/add_grad_all_reduce_for_sequence_parallel

Commits

Commits on Oct 16, 2023