Conversation
|
!test |
|
Review updated until commit 846fbee Description
Changes walkthrough 📝
PR Reviewer Guide 🔍Here are some key observations to aid the review process:
|
|
!test |
1 similar comment
|
!test |
|
The |
|
!test |
|
!test |
|
!test |
csrc/multidevice/communication.cpp
Outdated
| !communication->out()->getDeviceMesh().has(communication->root())) { | ||
| input_tensors.front().push_back(output_tensor); | ||
| !output_has_root) { | ||
| input_tensors.front().push_back(at::empty_like(splits.at(0))); |
There was a problem hiding this comment.
do we necessarily need to allocate a new buffer for inputs here?
|
!test |
| testing::Values(CommunicatorBackend::kNccl, CommunicatorBackend::kUcc), | ||
| all_meshes, | ||
| all_meshes, | ||
| testing::Values(mesh0, mesh1), |
There was a problem hiding this comment.
Changing this to avoid cases where root is not in output device mesh.
|
!test |
Adds support for
scatterwhen using loop split for sharding.Prepares for #3900
Similar to allgather, for correct results,
scatterrequires outermost allocation of the scattered axis.