[Auto] Fix xdist captured_info collisions (cluster-45561-3): merged 1 of 2 PRs by evalstate · Pull Request #38 · evalstate/transformers

evalstate · 2026-04-27T12:18:10Z

Cluster: cluster-45561-3
Base: origin/main
Branch: merge-cluster-cluster-45561-3-20260427115403

Merged PRs:

Fix xdist collisions for captured_info artifacts and preserve CI debug logs huggingface/transformers#45645: Merged locally; broader fix for xdist-safe captured_info files plus CI log display and notification artifact aggregation, with a local resolution to call the cleanup helper.

Skipped PRs:

Make patched testing debug logs xdist-safe huggingface/transformers#45639: Overlapping narrower duplicate of Fix xdist collisions for captured_info artifacts and preserve CI debug logs huggingface/transformers#45645; attempted after Fix xdist collisions for captured_info artifacts and preserve CI debug logs huggingface/transformers#45645 and conflicted in src/transformers/testing_utils.py and tests/utils/test_testing_utils.py.

Failed PRs:

None.

Notes:

Issue [Bug] pytest-xdist workers race on captured_info.txt in patched testing utils huggingface/transformers#45561 is open.
Local merge commit is 9cb1a72.
Merging Fix xdist collisions for captured_info artifacts and preserve CI debug logs huggingface/transformers#45645 pulled in upstream-main commits present on that PR branch because the local base origin/main was older than the PR head's merged main.
pytest tests/utils/test_testing_utils.py was attempted but failed during import due to missing safetensors before tests ran.
merge-report.md exists in the worktree root and was not staged or committed.

Next steps:

Review the extra upstream-main commits brought in by Fix xdist collisions for captured_info artifacts and preserve CI debug logs huggingface/transformers#45645 before any remote merge/push.
Install missing test dependencies or use the project CI image, then rerun PYTHONPATH=src pytest tests/utils/test_testing_utils.py.
Run style checks before any PR-ready handoff.
Close or supersede Make patched testing debug logs xdist-safe huggingface/transformers#45639 if maintainers accept the broader Fix xdist collisions for captured_info artifacts and preserve CI debug logs huggingface/transformers#45645-based resolution.

* Fix KV dedup for decode batches * Fix memory estimation * Change default * Added write-only fast path * Take both peaks into account * Revert unused config field * Review 1 * Fix p1s * Fix p2s and p3s that needed it * Added a TODO * Fix test, lower max cached graph, add TODO * Fix fragmentation with big warmup * Add more space for logits processors * Fix

@IlyasMoutawwakil

* Allow for registered experts from kernels hub * remove deepgemm as that is also dynamic * Apply repo consistency fixes * Update src/transformers/modeling_utils.py * Update src/transformers/modeling_utils.py * Apply repo consistency fixes * Apply suggestion from @IlyasMoutawwakil * Apply repo consistency fixes * get rid of triton dependency * keep eager first --------- Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com> Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com> Co-authored-by: Ilyas Moutawwakil <57442720+IlyasMoutawwakil@users.noreply.github.com> Co-authored-by: IlyasMoutawwakil <moutawwakil.ilyas.tsi@gmail.com>

* docs * feedback

update expectations for gemma3n

Summary: 1. fix torchao NVFP4 serialization with transformers 2. add a test to cover the fix While i'm here, also did the following bundled into this PR: 3. make the torchao serialization test have human readable names (easier to debug) 4. fix the float8 test (update the expected output) after this PR the test command for all torchao configs passes on an NVIDIA B200 Test Plan: ``` RUN_SLOW=1 pytest tests/quantization/torchao_integration/test_torchao.py -k "Serialization" -s ``` Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>

* added sonic moe * use lazy_load_kernel * style * use concatenated revision * final touches * fix * merge conflict * simpler naming * style * add sonicmoe test * skip fp32 on sonic * add transposed support * fix --------- Co-authored-by: vasqu <antonprogamer@gmail.com>

fix: continue when content is a string

* qa: bumped mlinter and allow local override * bump version * Update utils/check_modeling_rules_doc.py Co-authored-by: Anton Vlasjuk <73884904+vasqu@users.noreply.github.com> * license header * license header --------- Co-authored-by: Anton Vlasjuk <73884904+vasqu@users.noreply.github.com>

…#45610) * Fix missing conversion of experts Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> * Fix eager config attribute reading Co-authored-by: Copilot <copilot@github.com> Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> * Add proper error when kernels isn't installed Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> * remove unnecessary mapping Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> * review comments Co-authored-by: Copilot <copilot@github.com> Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> * remove double newline Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> --------- Signed-off-by: Harry Mellor <19981378+hmellor@users.noreply.github.com> Co-authored-by: Copilot <copilot@github.com>

…uggingface#45601) * fix: compute auxiliary losses when denoising is disabled in D-FINE * style: fix formatting * test: add regression test for auxiliary losses when denoising is disabled * test: fix num_labels config in auxiliary loss regression test --------- Co-authored-by: Yoni Gozlan <74535834+yonigozlan@users.noreply.github.com>

* remove warnings * fix * revert * revert useless * move function outside

…ing path (huggingface#45582) * generate: drop stale num_return_sequences warning on continuous batching path The continuous-batching branch warned that num_return_sequences was unsupported alongside num_beams, but generate_batch() already honors generation_config.num_return_sequences when expanding requests. The warning fires for any run that explicitly sets num_return_sequences even though the feature works, cluttering logs and misleading users. Drop the num_return_sequences half of the warning; keep the num_beams guard since beam search is still unsupported on the CB path. Fixes huggingface#45563 * Apply repo consistency fixes --------- Co-authored-by: Joaquin Hui Gomez <joaquinhuigomez@users.noreply.github.com> Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com> Co-authored-by: Rémi Ouazan <83456801+remi-or@users.noreply.github.com>

* skip * skip

* chore(qa): split pipeline and add type checking * added serving to quality * fmt

allow Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

…45631)

* circleci with torch 2.11 * circleci with torch 2.11 * circleci with torch 2.11 * circleci with torch 2.11 * circleci with torch 2.11 * circleci with torch 2.11 --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

…th `num_labels=1` (huggingface#45611) * Raise clear error for problem_type="single_label_classification" with num_labels=1 This combination is mathematically degenerate: applying cross-entropy loss to a single logit always yields zero loss, so training silently accomplishes nothing. Validate the combination in PreTrainedConfig.__post_init__ so users get a clear error at config construction with a pointer to the correct setup (num_labels=2 for binary classification, or problem_type="regression" for a single-output regression head). Closes huggingface#45479 * Update src/transformers/configuration_utils.py * Update tests/utils/test_configuration_utils.py * Update src/transformers/configuration_utils.py --------- Co-authored-by: Matt <Rocketknight1@users.noreply.github.com>

…g logs

…uggingface#45625) Add supports_gradient_checkpointing to NemotronHPreTrainedModel

* Add output language to chunks * Add output language to chunks * Fix formating * Return full language instead of iso code * revert changes (excep test) * correct fix * fix * values for runner --------- Co-authored-by: Eustache Le Bihan <eulebihan@gmail.com> Co-authored-by: eustlb <94853470+eustlb@users.noreply.github.com>

evalstate · 2026-04-27T12:18:33Z

Trace for this mergeability run: https://huggingface.co/datasets/evalstate/transformers-merge-experiments/blob/main/2604271314-7He5i3__dev__codex.jsonl

remi-or and others added 23 commits April 23, 2026 11:34

[docs] multi-turn tool calling (huggingface#45554)

bd69ed2

* docs * feedback

[AMD CI] Fix expectations for Gemma3n (huggingface#45602)

8e64e53

update expectations for gemma3n

Processing Utils: continue when content is a string (huggingface#45605)

1e071b2

fix: continue when content is a string

Remove unnecessary generate warnings (huggingface#45619)

16f3dde

* remove warnings * fix * revert * revert useless * move function outside

Skip failing offloading tests (huggingface#45624)

a66638d

* skip * skip

chore(qa): split pipeline and add type checking (huggingface#45432)

f0f456b

* chore(qa): split pipeline and add type checking * added serving to quality * fmt

Allow more artifacts to be download in CI (huggingface#45629)

23ca437

allow Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

chore: bump doc-builder SHA for main doc build workflow (huggingface#…

622b8e9

…45631)

CircleCI with torch 2.11 (huggingface#45633)

678e871

* circleci with torch 2.11 * circleci with torch 2.11 * circleci with torch 2.11 * circleci with torch 2.11 * circleci with torch 2.11 * circleci with torch 2.11 --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>

Fix xdist collisions for captured_info artifacts and preserve CI debu…

47a512b

…g logs

Add supports_gradient_checkpointing to NemotronHPreTrainedModel (h…

ded2b74

…uggingface#45625) Add supports_gradient_checkpointing to NemotronHPreTrainedModel

Merge branch 'main' into main

d94ced8

Merge PR huggingface#45645: Fix xdist captured_info collisions

9cb1a72

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Auto] Fix xdist captured_info collisions (cluster-45561-3): merged 1 of 2 PRs#38

[Auto] Fix xdist captured_info collisions (cluster-45561-3): merged 1 of 2 PRs#38
evalstate wants to merge 23 commits intomainfrom
merge-cluster-cluster-45561-3-20260427115403

evalstate commented Apr 27, 2026

Uh oh!

evalstate commented Apr 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

19 participants

Conversation

evalstate commented Apr 27, 2026

Uh oh!

evalstate commented Apr 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

19 participants