[Auto] Add DEIMv2 (cluster-41211-3): merged 1 of 2 PRs by evalstate · Pull Request #18 · evalstate/transformers

evalstate · 2026-04-23T22:50:10Z

Automated cluster merge for cluster-41211-3.

Merged PRs:

model: Add DEIMv2 to Transformers huggingface/transformers#44339 — Merged cleanly locally; active canonical DEIMv2 PR with recent maintainer approval and green targeted checks.

Skipped PRs:

Add DEIMv2 model, image processor, and basic tests huggingface/transformers#41356 — Superseded by model: Add DEIMv2 to Transformers huggingface/transformers#44339; stale incomplete DEIMv2 implementation and add/add conflicts on the same model/docs/test files when attempted after model: Add DEIMv2 to Transformers huggingface/transformers#44339.

Failed PRs:

None.

Notes:

Verified work was done on branch merge-cluster-cluster-41211-3-20260423223633 in the requested repo path.
Cluster issue Add DEIMv2 huggingface/transformers#41211 is still open.
Fetched PR heads from upstream as local refs pr-41356 and pr-44339.
Local merge commit created for model: Add DEIMv2 to Transformers huggingface/transformers#44339: 76d5c5d.
Attempted merge of Add DEIMv2 model, image processor, and basic tests huggingface/transformers#41356 was aborted cleanly after conflicts in DEIMv2 files.
Branch is left coherent with no active merge; pre-existing untracked agent/log files were left untouched.

Next steps:

Review the merged branch against upstream/main before any further action, since model: Add DEIMv2 to Transformers huggingface/transformers#44339's synced branch brought in upstream history.
If validating locally, run targeted DEIMv2 checks such as make style and the DEIMv2 test subset.

* Fix KV dedup for decode batches * Fix memory estimation * Change default * Added write-only fast path * Take both peaks into account * Revert unused config field * Review 1 * Fix p1s * Fix p2s and p3s that needed it * Added a TODO * Fix test, lower max cached graph, add TODO * Fix fragmentation with big warmup * Add more space for logits processors * Fix

@IlyasMoutawwakil

* Allow for registered experts from kernels hub * remove deepgemm as that is also dynamic * Apply repo consistency fixes * Update src/transformers/modeling_utils.py * Update src/transformers/modeling_utils.py * Apply repo consistency fixes * Apply suggestion from @IlyasMoutawwakil * Apply repo consistency fixes * get rid of triton dependency * keep eager first --------- Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com> Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com> Co-authored-by: Ilyas Moutawwakil <57442720+IlyasMoutawwakil@users.noreply.github.com> Co-authored-by: IlyasMoutawwakil <moutawwakil.ilyas.tsi@gmail.com>

* docs * feedback

update expectations for gemma3n

Summary: 1. fix torchao NVFP4 serialization with transformers 2. add a test to cover the fix While i'm here, also did the following bundled into this PR: 3. make the torchao serialization test have human readable names (easier to debug) 4. fix the float8 test (update the expected output) after this PR the test command for all torchao configs passes on an NVIDIA B200 Test Plan: ``` RUN_SLOW=1 pytest tests/quantization/torchao_integration/test_torchao.py -k "Serialization" -s ``` Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>

* added sonic moe * use lazy_load_kernel * style * use concatenated revision * final touches * fix * merge conflict * simpler naming * style * add sonicmoe test * skip fp32 on sonic * add transposed support * fix --------- Co-authored-by: vasqu <antonprogamer@gmail.com>

fix: continue when content is a string

* qa: bumped mlinter and allow local override * bump version * Update utils/check_modeling_rules_doc.py Co-authored-by: Anton Vlasjuk <73884904+vasqu@users.noreply.github.com> * license header * license header --------- Co-authored-by: Anton Vlasjuk <73884904+vasqu@users.noreply.github.com>

evalstate · 2026-04-23T22:50:28Z

Trace for this mergeability run: https://huggingface.co/datasets/evalstate/transformers-merge-experiments/blob/main/2604232346-Cj5qGC__dev__codex.jsonl

harshaljanjani and others added 30 commits February 27, 2026 22:06

init: Add files (v1)

eaef822

fix: Fix ci/circleci: check_repository_consistency

ddc1bd7

feat: Add support and test harness for all variants

85c7356

fix: Fix ci/circleci: check_repository_consistency

adc4079

Merge branch 'main' into add-deimv2

81a3d06

refactor: Resolve review comments

39d300e

Merge branch 'main' into add-deimv2

476d69f

refactor: Resolve second review round

4ad0dc5

nit: Fix copyright year

16f2d07

Merge branch 'main' into add-deimv2

78eaf93

Merge branch 'main' into add-deimv2

dbe577b

Merge branch 'main' into add-deimv2

1259628

refactor: Resolve third review round

31ee908

revert: Adhere to the pattern from yonigozlan

4a3a877

Merge branch 'main' into add-deimv2

558c2af

nit: Clarify the docstring

ada78bf

refactor: Resolve fourth review round

496ce9c

Merge branch 'main' into add-deimv2

5a12a56

Merge branch 'main' into add-deimv2

85b4079

refactor: Closing in on the final set of nits

422a440

Merge branch 'main' into add-deimv2

f932158

fix: Resolve merge conflicts

b833ee3

fix: Add loss override and address nits

58a6424

nits: Fix minor issues

7dd0fb1

fixup their init weights

943f4bb

Merge branch 'main' into add-deimv2

6213518

[docs] multi-turn tool calling (huggingface#45554)

bd69ed2

* docs * feedback

[AMD CI] Fix expectations for Gemma3n (huggingface#45602)

8e64e53

update expectations for gemma3n

vkuzo and others added 7 commits April 23, 2026 12:22

Processing Utils: continue when content is a string (huggingface#45605)

1e071b2

fix: continue when content is a string

fix: Fix loss coupling issue

fb1f387

Merge branch 'main' into add-deimv2

3629f13

Merge PR huggingface#44339: model: Add DEIMv2 to Transformers

76d5c5d

evalstate closed this Apr 24, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Auto] Add DEIMv2 (cluster-41211-3): merged 1 of 2 PRs#18

[Auto] Add DEIMv2 (cluster-41211-3): merged 1 of 2 PRs#18
evalstate wants to merge 37 commits intomainfrom
merge-cluster-cluster-41211-3-20260423223633

evalstate commented Apr 23, 2026

Uh oh!

evalstate commented Apr 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

11 participants

Conversation

evalstate commented Apr 23, 2026

Uh oh!

evalstate commented Apr 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

11 participants