Skip to content

fix Dtensor and tensor mismatch for Col/RowRep#42924

Merged
ArthurZucker merged 25 commits intomainfrom
tp_replicate_interface
Dec 17, 2025
Merged

fix Dtensor and tensor mismatch for Col/RowRep#42924
ArthurZucker merged 25 commits intomainfrom
tp_replicate_interface

Conversation

@3outeille
Copy link
Copy Markdown
Member

same issue as #42906

@HuggingFaceDocBuilderDev
Copy link
Copy Markdown

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

@ArthurZucker ArthurZucker merged commit 99be81e into main Dec 17, 2025
26 of 34 checks passed
@ArthurZucker ArthurZucker deleted the tp_replicate_interface branch December 17, 2025 15:39
SangbumChoi pushed a commit to SangbumChoi/transformers that referenced this pull request Jan 23, 2026
* begin Moe test tensor parallel

* create tiny moe model + fix test tensor parallel Moe

eaeaae

* create tiny moe model + fix test tensor parallel Moe

eaeaae

fix tensor parallel MoE test
fix tensor parallel MoE test

* fix backward pass test in tensor parallel for Dense model (huggingface#42811)

* fix

* linting

* use mixtral instead for testing

* fix dtensor and tensor mismatch

* linting

* checkout test tensor parallel to be like main

* avoid hack and create class instead

* fix loading ep

* add moe test

* now EP inference works again but pass still fails

* Add ColwiseParallelReplicate and RowwiseParallelReplicate classes for replicated layouts

* clean

* eaza

* aeaeaea

* eaeaa

* linting
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants