[Auto] Add DEIMv2 (cluster-41211-3): merged 1 of 2 PRs by evalstate · Pull Request #34 · evalstate/transformers

evalstate · 2026-04-27T11:59:56Z

Cluster: cluster-41211-3
Base ref: origin/main
Branch: merge-cluster-cluster-41211-3-20260427115403

Merged:

model: Add DEIMv2 to Transformers huggingface/transformers#44339: Canonical newer DEIMv2 implementation; merged locally cleanly with no manual conflict resolution.

Skipped:

Add DEIMv2 model, image processor, and basic tests huggingface/transformers#41356: Older duplicate DEIMv2 implementation superseded by model: Add DEIMv2 to Transformers huggingface/transformers#44339; conflict probe showed add/add conflicts on overlapping DEIMv2 files.

Failed:

None.

Notes:

Issue Add DEIMv2 huggingface/transformers#41211 is open and the cluster contains PRs model: Add DEIMv2 to Transformers huggingface/transformers#44339 and Add DEIMv2 model, image processor, and basic tests huggingface/transformers#41356.
Local merge commit for model: Add DEIMv2 to Transformers huggingface/transformers#44339 is c81eeec.
Branch is ahead of origin/main by 37 commits after merging model: Add DEIMv2 to Transformers huggingface/transformers#44339.
No tests were run during this local mergeability pass.

Next steps:

Treat model: Add DEIMv2 to Transformers huggingface/transformers#44339 as the viable candidate for this cluster.
Do not merge Add DEIMv2 model, image processor, and basic tests huggingface/transformers#41356 unless maintainers explicitly want a distinct cherry-pick from the older implementation.
Run targeted DEIMv2 tests and repository style/check tooling before any upstream PR action.

* Fix KV dedup for decode batches * Fix memory estimation * Change default * Added write-only fast path * Take both peaks into account * Revert unused config field * Review 1 * Fix p1s * Fix p2s and p3s that needed it * Added a TODO * Fix test, lower max cached graph, add TODO * Fix fragmentation with big warmup * Add more space for logits processors * Fix

@IlyasMoutawwakil

* Allow for registered experts from kernels hub * remove deepgemm as that is also dynamic * Apply repo consistency fixes * Update src/transformers/modeling_utils.py * Update src/transformers/modeling_utils.py * Apply repo consistency fixes * Apply suggestion from @IlyasMoutawwakil * Apply repo consistency fixes * get rid of triton dependency * keep eager first --------- Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com> Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com> Co-authored-by: Ilyas Moutawwakil <57442720+IlyasMoutawwakil@users.noreply.github.com> Co-authored-by: IlyasMoutawwakil <moutawwakil.ilyas.tsi@gmail.com>

* docs * feedback

update expectations for gemma3n

Summary: 1. fix torchao NVFP4 serialization with transformers 2. add a test to cover the fix While i'm here, also did the following bundled into this PR: 3. make the torchao serialization test have human readable names (easier to debug) 4. fix the float8 test (update the expected output) after this PR the test command for all torchao configs passes on an NVIDIA B200 Test Plan: ``` RUN_SLOW=1 pytest tests/quantization/torchao_integration/test_torchao.py -k "Serialization" -s ``` Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>

* added sonic moe * use lazy_load_kernel * style * use concatenated revision * final touches * fix * merge conflict * simpler naming * style * add sonicmoe test * skip fp32 on sonic * add transposed support * fix --------- Co-authored-by: vasqu <antonprogamer@gmail.com>

fix: continue when content is a string

* qa: bumped mlinter and allow local override * bump version * Update utils/check_modeling_rules_doc.py Co-authored-by: Anton Vlasjuk <73884904+vasqu@users.noreply.github.com> * license header * license header --------- Co-authored-by: Anton Vlasjuk <73884904+vasqu@users.noreply.github.com>

evalstate · 2026-04-27T12:00:11Z

Trace for this mergeability run: https://huggingface.co/datasets/evalstate/transformers-merge-experiments/blob/main/2604271257-X8iKhj__dev__codex.jsonl

harshaljanjani and others added 30 commits February 27, 2026 22:06

init: Add files (v1)

eaef822

fix: Fix ci/circleci: check_repository_consistency

ddc1bd7

feat: Add support and test harness for all variants

85c7356

fix: Fix ci/circleci: check_repository_consistency

adc4079

Merge branch 'main' into add-deimv2

81a3d06

refactor: Resolve review comments

39d300e

Merge branch 'main' into add-deimv2

476d69f

refactor: Resolve second review round

4ad0dc5

nit: Fix copyright year

16f2d07

Merge branch 'main' into add-deimv2

78eaf93

Merge branch 'main' into add-deimv2

dbe577b

Merge branch 'main' into add-deimv2

1259628

refactor: Resolve third review round

31ee908

revert: Adhere to the pattern from yonigozlan

4a3a877

Merge branch 'main' into add-deimv2

558c2af

nit: Clarify the docstring

ada78bf

refactor: Resolve fourth review round

496ce9c

Merge branch 'main' into add-deimv2

5a12a56

Merge branch 'main' into add-deimv2

85b4079

refactor: Closing in on the final set of nits

422a440

Merge branch 'main' into add-deimv2

f932158

fix: Resolve merge conflicts

b833ee3

fix: Add loss override and address nits

58a6424

nits: Fix minor issues

7dd0fb1

fixup their init weights

943f4bb

Merge branch 'main' into add-deimv2

6213518

[docs] multi-turn tool calling (huggingface#45554)

bd69ed2

* docs * feedback

[AMD CI] Fix expectations for Gemma3n (huggingface#45602)

8e64e53

update expectations for gemma3n

vkuzo and others added 7 commits April 23, 2026 12:22

Processing Utils: continue when content is a string (huggingface#45605)

1e071b2

fix: continue when content is a string

fix: Fix loss coupling issue

fb1f387

Merge branch 'main' into add-deimv2

3629f13

Merge PR huggingface#44339: model: Add DEIMv2 to Transformers

c81eeec

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Auto] Add DEIMv2 (cluster-41211-3): merged 1 of 2 PRs#34

[Auto] Add DEIMv2 (cluster-41211-3): merged 1 of 2 PRs#34
evalstate wants to merge 37 commits intomainfrom
merge-cluster-cluster-41211-3-20260427115403

evalstate commented Apr 27, 2026

Uh oh!

evalstate commented Apr 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

11 participants

Conversation

evalstate commented Apr 27, 2026

Uh oh!

evalstate commented Apr 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

11 participants