Skip to content

fix: NotImplementedError: aten::equal on meta tensors during multi-GPU init#1769

Merged
akoumpa merged 5 commits intoNVIDIA-NeMo:mainfrom
harshareddy832:fix/1765-notimplementederror-aten-equal-on-meta-tensors-dur
Apr 13, 2026
Merged

fix: NotImplementedError: aten::equal on meta tensors during multi-GPU init#1769
akoumpa merged 5 commits intoNVIDIA-NeMo:mainfrom
harshareddy832:fix/1765-notimplementederror-aten-equal-on-meta-tensors-dur

Conversation

@harshareddy832
Copy link
Copy Markdown
Contributor

Fixes #1765

Changes:

…gpu model init with transformers >= 5.4.0 (NVIDIA-NeMo#1765)

Signed-off-by: Harsha Pasham <pashamharsha018@gmail.com>
@copy-pr-bot
Copy link
Copy Markdown

copy-pr-bot Bot commented Apr 10, 2026

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

@harshareddy832 harshareddy832 changed the title NotImplementedError: aten::equal on meta tensors during multi-GPU model init with transformers >= 5.4.0 fix: NotImplementedError: aten::equal on meta tensors during multi-GPU model init with transformers >= 5.4.0 Apr 10, 2026
…gpu model init with transformers >= 5.4.0 (NVIDIA-NeMo#1765)

Signed-off-by: Harsha Pasham <pashamharsha018@gmail.com>
@harshareddy832 harshareddy832 requested a review from a team as a code owner April 10, 2026 16:57
@akoumpa
Copy link
Copy Markdown
Contributor

akoumpa commented Apr 10, 2026

/ok to test 421e37b

@akoumpa
Copy link
Copy Markdown
Contributor

akoumpa commented Apr 13, 2026

/ok to test 1e2023f

@akoumpa
Copy link
Copy Markdown
Contributor

akoumpa commented Apr 13, 2026

@akoumpa akoumpa merged commit ea97177 into NVIDIA-NeMo:main Apr 13, 2026
43 of 45 checks passed
svcnvidia-nemo-ci pushed a commit that referenced this pull request Apr 13, 2026
…GPU init (#1769)

* fix: `notimplementederror: aten::equal` on meta tensors during multi-gpu model init with transformers >= 5.4.0 (#1765)

Signed-off-by: Harsha Pasham <pashamharsha018@gmail.com>

* fix: `notimplementederror: aten::equal` on meta tensors during multi-gpu model init with transformers >= 5.4.0 (#1765)

Signed-off-by: Harsha Pasham <pashamharsha018@gmail.com>

* Add pull_request types to workflow triggers

* Apply suggestion from @akoumpa

---------

Signed-off-by: Harsha Pasham <pashamharsha018@gmail.com>
Co-authored-by: Alexandros Koumparoulis <153118171+akoumpa@users.noreply.github.com>
Signed-off-by: NeMo Bot <nemo-bot@nvidia.com>
akoumpa added a commit that referenced this pull request Apr 13, 2026
…ti-GPU init (1769)` into `r0.4.0` (#1797)

fix: `NotImplementedError: aten::equal` on meta tensors during multi-GPU init (#1769)

* fix: `notimplementederror: aten::equal` on meta tensors during multi-gpu model init with transformers >= 5.4.0 (#1765)



* fix: `notimplementederror: aten::equal` on meta tensors during multi-gpu model init with transformers >= 5.4.0 (#1765)



* Add pull_request types to workflow triggers

* Apply suggestion from @akoumpa

---------

Signed-off-by: Harsha Pasham <pashamharsha018@gmail.com>
Signed-off-by: NeMo Bot <nemo-bot@nvidia.com>
Co-authored-by: Harsha Pasham <53609097+harshareddy832@users.noreply.github.com>
Co-authored-by: Alexandros Koumparoulis <153118171+akoumpa@users.noreply.github.com>
edjson pushed a commit to edjson/Automodel that referenced this pull request Apr 17, 2026
…GPU init (NVIDIA-NeMo#1769)

* fix: `notimplementederror: aten::equal` on meta tensors during multi-gpu model init with transformers >= 5.4.0 (NVIDIA-NeMo#1765)

Signed-off-by: Harsha Pasham <pashamharsha018@gmail.com>

* fix: `notimplementederror: aten::equal` on meta tensors during multi-gpu model init with transformers >= 5.4.0 (NVIDIA-NeMo#1765)

Signed-off-by: Harsha Pasham <pashamharsha018@gmail.com>

* Add pull_request types to workflow triggers

* Apply suggestion from @akoumpa

---------

Signed-off-by: Harsha Pasham <pashamharsha018@gmail.com>
Co-authored-by: Alexandros Koumparoulis <153118171+akoumpa@users.noreply.github.com>
edjson pushed a commit to edjson/Automodel that referenced this pull request Apr 18, 2026
…GPU init (NVIDIA-NeMo#1769)

* fix: `notimplementederror: aten::equal` on meta tensors during multi-gpu model init with transformers >= 5.4.0 (NVIDIA-NeMo#1765)

Signed-off-by: Harsha Pasham <pashamharsha018@gmail.com>

* fix: `notimplementederror: aten::equal` on meta tensors during multi-gpu model init with transformers >= 5.4.0 (NVIDIA-NeMo#1765)

Signed-off-by: Harsha Pasham <pashamharsha018@gmail.com>

* Add pull_request types to workflow triggers

* Apply suggestion from @akoumpa

---------

Signed-off-by: Harsha Pasham <pashamharsha018@gmail.com>
Co-authored-by: Alexandros Koumparoulis <153118171+akoumpa@users.noreply.github.com>
Signed-off-by: Edison <edisonggacc@gmail.com>
linnanwang pushed a commit that referenced this pull request Apr 24, 2026
…GPU init (#1769)

* fix: `notimplementederror: aten::equal` on meta tensors during multi-gpu model init with transformers >= 5.4.0 (#1765)

Signed-off-by: Harsha Pasham <pashamharsha018@gmail.com>

* fix: `notimplementederror: aten::equal` on meta tensors during multi-gpu model init with transformers >= 5.4.0 (#1765)

Signed-off-by: Harsha Pasham <pashamharsha018@gmail.com>

* Add pull_request types to workflow triggers

* Apply suggestion from @akoumpa

---------

Signed-off-by: Harsha Pasham <pashamharsha018@gmail.com>
Co-authored-by: Alexandros Koumparoulis <153118171+akoumpa@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

community-request r0.4.0 Auto-cherrypick to release branch. Apply before merge; cherrypick happens after merge.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

NotImplementedError: aten::equal on meta tensors during multi-GPU model init with transformers >= 5.4.0

4 participants