Remove tests with inner sharded dimensions when using MultiDeviceExecutor#4470
Remove tests with inner sharded dimensions when using MultiDeviceExecutor#4470
Conversation
|
Review updated until commit 3be85a4 Description
Changes walkthrough 📝
PR Reviewer Guide 🔍Here are some key observations to aid the review process:
|
| shardTensor(ref_unsharded_outputs[i].as<at::Tensor>(), output_tv); | ||
| auto obtained_output = outputs[i].as<at::Tensor>(); | ||
|
|
||
| EXPECT_EQ(ref_output.strides(), obtained_output.strides()) << "Strides are not equal: Ref: " << ref_output.strides() << " Output: " << obtained_output.strides() << std::endl; |
There was a problem hiding this comment.
torch::allclose checks for sizes but not strides, so leaving this verification here for future.
|
!test |
|
!build |
Prep PR for Issue #3900.
I am modifying the
reorderShardedAxisPassto set allocation domain consistent with the memory layout requirements of ProcessGroup NCCL and UCC, without changing the logical shape (see PR #4170 for example).MultiDeviceExecutor does not respect allocation domain, hence, removing these tests. Issue #4453.