Minor changes trainer by SunMarc · Pull Request #43744 · huggingface/transformers

SunMarc · 2026-02-04T17:16:20Z

What does this PR do?

This PR updates a few minor things from trainer. Some of them are breaking but I think it should be safe to do as I don't think anyone is subclassing them or using them separately.

Breaking:

move propagate_args_to_deepspeed to standalone function in integrations.deepspeed
move _fsdp_qlora_plugin_updates to standalone function in integrations.fsdp + rename to update_fsdp_plugin_peft
move is_attention_mask_causal to standalone function in trainer_pt_utils.py
move _nested_gather to standalone function in trainer_pt_utils.py
remove _add_sm_patterns_to_gitignore -> not used at all
move _align_special_tokens to standalone function in trainer_pt_utils.py
move deepspeed_sp_compute_loss to integrations.deepspeed
move deepspeed_sp_compute_loss to integrations.deepspeed
move _save_tpu to save_tpu_checkpoint in integrations.tpu file
added wrap_model_xla_fsdp to move logic in integrations.fsdp` file

Not breaking:

move get_fsdp_ckpt_kwargs to integrations.fsdp
move safe_globals to trainer_utils.py
move _get_learning_rate in Trainer as this method is actually used and not only in examples.

HuggingFaceDocBuilderDev · 2026-02-04T17:25:32Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

qgallouedec · 2026-02-05T02:07:47Z

Thanks, I'll review asap

SunMarc · 2026-02-05T15:10:17Z

cc @winglian

SunMarc · 2026-02-05T17:12:46Z

@bot /style

github-actions · 2026-02-05T17:13:19Z

Style fix bot fixed some files and pushed the changes.

qgallouedec

this looks good!

…ormers into minor-changes-trainer

ArthurZucker

I am down to keep aliases for now + deprecate (but its up to you if you can adds lots of coms about it)

Trying to refrain from too many breaks from v5 already

SunMarc · 2026-02-09T16:00:10Z

Can you run the TRL CI on this PR @qgallouedec ? Also @winglian, is there a way to run the axolotl CI just like we do for TRL ? One of my next step also would be to create nice tests like TRL so that we don't need to rely on TRL to catch regressions.

I am down to keep aliases for now + deprecate (but its up to you if you can adds lots of coms about it)

I think for these, it shouldn't be too breaking. There are other more sensible arg that will indeed require to deprecate if we want to ensure a smooth transition in a future PR. If any of those are too breaking, happy to put it back and do a proper deprecation cycle.

qgallouedec · 2026-02-10T16:19:55Z

/trl-ci

qgallouedec · 2026-02-10T16:20:19Z

https://github.com/huggingface/trl/actions/runs/21873008073

tomaarsen · 2026-02-17T11:47:46Z

This resulted in a regression in Sentence Transformers due to the removal of _nested_gather. I'll try to push out a release soon to resolve it. See huggingface/sentence-transformers#3664 for more details.

Tom Aarsen

SunMarc added 4 commits February 4, 2026 16:16

update

286afec

move it !

6081c1a

update

acad19b

update

4462fab

SunMarc requested review from ArthurZucker and qgallouedec February 4, 2026 17:27

fix

dd5e766

SunMarc mentioned this pull request Feb 4, 2026

Tell Us: What Would Make Trainer Better? #43595

Open

SunMarc and others added 2 commits February 5, 2026 14:40

Merge branch 'main' into minor-changes-trainer

3f78d20

Move compare_trainer_and_checkpoint_args

326b2cd

update

039634b

winglian reviewed Feb 5, 2026

View reviewed changes

Comment thread src/transformers/trainer_pt_utils.py

update comment

b28996d

SunMarc requested a review from winglian February 5, 2026 16:35

SunMarc added 3 commits February 5, 2026 16:45

fix

c81d6a4

Remove Union

7ff4af3

remove !

5fc3775

github-actions Bot and others added 4 commits February 5, 2026 17:16

Apply repo consistency fixes

0aca5c1

better now

dabfe9a

Merge remote-tracking branch 'origin/main' into minor-changes-trainer

8d1cf8e

style

1b02745

qgallouedec mentioned this pull request Feb 5, 2026

Add support for nested_gather in OnlineDPOTrainer for transformers v5.2.0 and above huggingface/trl#4981

Merged

qgallouedec reviewed Feb 5, 2026

View reviewed changes

Comment thread src/transformers/trainer.py

qgallouedec reviewed Feb 5, 2026

View reviewed changes

Comment thread src/transformers/trainer.py Outdated

qgallouedec approved these changes Feb 5, 2026

View reviewed changes

winglian approved these changes Feb 6, 2026

View reviewed changes

albertvillanova reviewed Feb 6, 2026

View reviewed changes

Comment thread src/transformers/trainer_utils.py Outdated

SunMarc and others added 7 commits February 6, 2026 14:21

let's try this

52ff702

update docstring

a10da38

Merge branch 'main' into minor-changes-trainer

a368ed3

move a few things

9fb7a22

style

6d0d2b8

Merge branch 'minor-changes-trainer' of github.com:huggingface/transf…

84028df

…ormers into minor-changes-trainer

Merge branch 'main' into minor-changes-trainer

269e816

SunMarc requested a review from kashif February 6, 2026 15:42

ArthurZucker approved these changes Feb 9, 2026

View reviewed changes

Merge branch 'main' into minor-changes-trainer

3439805

SunMarc and others added 2 commits February 10, 2026 15:24

Merge remote-tracking branch 'origin/main' into minor-changes-trainer

4afd74d

Merge branch 'main' into minor-changes-trainer

b0285e9

SunMarc merged commit 24c743e into main Feb 10, 2026
26 checks passed

SunMarc deleted the minor-changes-trainer branch February 10, 2026 20:03

jiosephlee pushed a commit to jiosephlee/transformers_latest that referenced this pull request Feb 11, 2026

Minor changes trainer (huggingface#43744)

f7104ab

tomaarsen mentioned this pull request Feb 17, 2026

[compat] Introduce Transformers v5.2 compatibility: trainer _nested_gather moved huggingface/sentence-transformers#3664

Merged

albertvillanova mentioned this pull request Apr 9, 2026

Remove redundant alignment of pad_token_id huggingface/trl#5487

Merged

Conversation

SunMarc commented Feb 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Uh oh!

HuggingFaceDocBuilderDev commented Feb 4, 2026

Uh oh!

qgallouedec commented Feb 5, 2026

Uh oh!

SunMarc commented Feb 5, 2026

Uh oh!

Uh oh!

SunMarc commented Feb 5, 2026

Uh oh!

github-actions Bot commented Feb 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

qgallouedec left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

ArthurZucker left a comment

Choose a reason for hiding this comment

Uh oh!

SunMarc commented Feb 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

qgallouedec commented Feb 10, 2026

Uh oh!

qgallouedec commented Feb 10, 2026

Uh oh!

Uh oh!

tomaarsen commented Feb 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

8 participants

SunMarc commented Feb 4, 2026 •

edited

Loading

github-actions Bot commented Feb 5, 2026 •

edited

Loading

SunMarc commented Feb 9, 2026 •

edited

Loading