Skip to content

Conversation

@rraminen
Copy link

Enabled Megatron-LM-v1.1.5-ZeRO3-8.3B_param model on 8 GPUs and MP_SIZE=1

To run:
cd DeepSpeedExamples/Megatron-LM-v1.1.5-ZeRO3
bash examples/ds_pretrain_gpt2-zero3_8.3B_params.sh

@jithunnair-amd
Copy link

@rraminen Please post a link to the samples/sec numbers with old and new Megatron when you have them

@jithunnair-amd jithunnair-amd merged commit a62a5b9 into ROCm:master Nov 18, 2021
@jithunnair-amd
Copy link

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants