fixing default communication_data_type for bfloat16_enabled and docs#3370
fixing default communication_data_type for bfloat16_enabled and docs#3370tjruwase merged 3 commits intodeepspeedai:masterfrom
Conversation
|
@clumsy According to the discussion here #2911 (comment), it seems intentional that bf16's default communication_data_type is set to float32. I'm curious, did you encounter any issues when using bf16 + fp32 communication_data_type? |
|
Yes @Sanster, let me mention the regression issue we faced. Perhaps more importantly this Now back to your question of whether to default to fp32. In the spirit of |
@clumsy, thanks for sharing this explanation. Apologies for breaking BC with the above change. I am aligned with |
fixes #2071
#2145 fixed the case when
communication_data_typeis set explicitly, unfortunately the default value for whenbfloat16_enabled=Trueis notbfp16.Adding unit tests for default
communication_data_type.Correcting the docs that mentioned
communication_data_typeas being of typeboolean.Before the fix:
After the fix: