Ensure final evaluation runs with step-based evaluation strategy by khushali9 · Pull Request #44146 · huggingface/transformers

khushali9 · 2026-02-19T05:29:21Z

What does this PR do?

When using a step-based evaluation strategy (IntervalStrategy.STEPS), the trainer may skip evaluation at the final step if the last step does not align with eval_steps.

This avoids missing the final evaluation while also preventing duplicate evaluations when the last step already matches eval_steps.

In short:
We now guarantee a final evaluation for step-based strategies without double-evaluating.

Fixes #43935

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

Rocketknight1 · 2026-02-19T13:30:28Z

cc @SunMarc

SunMarc

Hey, as I commented, I think that the easier solution would be to update the callback to set should_evaluate to True at the end of the training is the user choose step

khushali9 · 2026-02-20T18:29:10Z

Hey, as I commented, I think that the easier solution would be to update the callback to set should_evaluate to True at the end of the training is the user choose step

Not able to see your comment, but I think I understood, let me update PR.

khushali9 · 2026-02-23T23:41:02Z

@SunMarc what do you think ?

SunMarc

Thanks ! just left a couple of questions !

khushali9 · 2026-02-24T23:24:48Z

Thanks ! just left a couple of questions !

Agreed with yours asks, there were specific test failures related to some of them in CI that I had to add these many, else my first commit after your comment was just simple if and setting should_evaluate=True

khushali9 · 2026-02-25T00:39:54Z

@SunMarc I have updated PR, earlier I added condition to avoid test changes as behavior changed for some, so now I have kept logic to check simple and changed some tests. Thanks

khushali9 · 2026-02-26T16:07:48Z

@SunMarc Any update on this ? I updated and addressed all your queries.

SunMarc

Thanks, please update the description of the PR. cc @qgallouedec @winglian wdyt ?

qgallouedec

Devin Review found 3 potential issues.

View 2 additional findings in Devin Review.

khushali9 · 2026-03-02T23:26:18Z

@qgallouedec addressed delay issue with tests update, can you review it again. Thanks

khushali9 · 2026-03-04T01:02:39Z

@SunMarc how can I take this forward to merge, I did took care of @qgallouedec' all feedback.

khushali9 · 2026-03-05T16:20:58Z

@qgallouedec how can I take this forward ? already made changes as per your feedback.

khushali9 · 2026-03-09T17:43:03Z

@SunMarc @qgallouedec how can I take this forward to merge. Thanks

khushali9 · 2026-03-11T21:36:19Z

@Rocketknight1 any help to take this forward, as it is approved and I took care of all feedbacks.

khushali9 · 2026-03-16T23:26:11Z

@Rocketknight1 @qgallouedec @SunMarc Any help to merge this forward ? I took care of all the feedback.

khushali9 · 2026-03-17T17:36:38Z

@Rocketknight1 @SunMarc breaking test does not seem to be related to my change

khushali9 · 2026-03-23T21:45:39Z

@SunMarc @Rocketknight1 ANy update for this issue. Thanks

khushali9 · 2026-03-25T15:02:57Z

@SunMarc @qgallouedec can you please help with taking this forward to merge. Thanks

SunMarc · 2026-03-26T14:48:10Z

merged !

HuggingFaceDocBuilderDev · 2026-03-26T14:57:56Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

khushali9 · 2026-03-26T15:32:46Z

@SunMarc thanks for trying to merge, I think it needed branch update wit main, and CI test failure were not related to my change. So if you merge now it will be successful. Thanks

…gingface#44146) * rebase * merge conflict * merge conflict1 * merge conflict trainer * blank space qulity run * lint error * modify test to address our change * rebase * rebase * rebase * rebase * test updated with delay check * checkpoint tests updated * test updated in utils * correct test condition * style format

khushali9 force-pushed the add-eval-on-end branch from 0490497 to 4d25e5f Compare February 19, 2026 05:53

khushali9 commented Feb 19, 2026

View reviewed changes

Comment thread src/transformers/trainer.py Outdated

SunMarc reviewed Feb 20, 2026

View reviewed changes

khushali9 requested a review from SunMarc February 20, 2026 18:31

khushali9 commented Feb 20, 2026

View reviewed changes

Comment thread src/transformers/trainer_callback.py Outdated

khushali9 commented Feb 20, 2026

View reviewed changes

Comment thread src/transformers/trainer_callback.py Outdated

SunMarc reviewed Feb 24, 2026

View reviewed changes

Comment thread src/transformers/trainer_callback.py Outdated

Comment thread src/transformers/trainer_callback.py Outdated

Comment thread src/transformers/trainer_callback.py Outdated

Comment thread src/transformers/trainer_callback.py Outdated

khushali9 requested a review from SunMarc February 25, 2026 00:40

SunMarc approved these changes Mar 2, 2026

View reviewed changes

SunMarc requested review from qgallouedec and winglian March 2, 2026 17:17

khushali9 changed the title ~~added eval_on_end to trainer~~ Ensure final evaluation runs with step-based evaluation strategy Mar 2, 2026

qgallouedec reviewed Mar 2, 2026

View reviewed changes

Comment thread src/transformers/trainer_callback.py Outdated

Comment thread tests/trainer/test_trainer_callback.py

Comment thread tests/trainer/test_trainer.py Outdated

khushali9 requested a review from qgallouedec March 2, 2026 23:52

qgallouedec reviewed Mar 4, 2026

View reviewed changes

Comment thread tests/trainer/test_trainer_callback.py Outdated

khushali9 requested a review from qgallouedec March 4, 2026 16:19

khushali9 added 2 commits March 17, 2026 09:44

rebase

ce3b1f0

merge conflict

5f567d4

khushali9 added 4 commits March 17, 2026 10:00

test updated with delay check

64716e1

checkpoint tests updated

faea9c9

test updated in utils

cc5e9b8

correct test condition

b26cb69

khushali9 force-pushed the add-eval-on-end branch from 952a90d to b26cb69 Compare March 17, 2026 17:02

style format

3bd1307

khushali9 added 4 commits March 17, 2026 10:49

Merge branch 'main' into add-eval-on-end

e8b7c60

Merge branch 'main' into add-eval-on-end

c4d62a9

Merge branch 'main' into add-eval-on-end

3746035

Merge branch 'main' into add-eval-on-end

e16f24c

khushali9 added 2 commits March 24, 2026 15:49

Merge branch 'main' into add-eval-on-end

b91249f

Merge branch 'main' into add-eval-on-end

75c80db

Merge branch 'main' into add-eval-on-end

df8c19f

SunMarc enabled auto-merge March 26, 2026 14:48

SunMarc added this pull request to the merge queue Mar 26, 2026

github-merge-queue Bot removed this pull request from the merge queue due to failed status checks Mar 26, 2026

Merge branch 'main' into add-eval-on-end

5d2ce03

SunMarc enabled auto-merge March 26, 2026 16:08

SunMarc added this pull request to the merge queue Mar 26, 2026

Merged via the queue into huggingface:main with commit ec19614 Mar 26, 2026
29 checks passed

albertvillanova mentioned this pull request Apr 1, 2026

CI fails for experimental tests: ValueError: You must specify exactly one of input_ids or inputs_embeds huggingface/trl#5421

Closed

Conversation

khushali9 commented Feb 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Before submitting

Who can review?

Uh oh!

Uh oh!

Rocketknight1 commented Feb 19, 2026

Uh oh!

SunMarc left a comment

Choose a reason for hiding this comment

Uh oh!

khushali9 commented Feb 20, 2026

Uh oh!

Uh oh!

Uh oh!

khushali9 commented Feb 23, 2026

Uh oh!

SunMarc left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

khushali9 commented Feb 24, 2026

Uh oh!

khushali9 commented Feb 25, 2026

Uh oh!

khushali9 commented Feb 26, 2026

Uh oh!

SunMarc left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

qgallouedec left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

khushali9 commented Mar 2, 2026

Uh oh!

khushali9 commented Mar 4, 2026

Uh oh!

Uh oh!

khushali9 commented Mar 5, 2026

Uh oh!

khushali9 commented Mar 9, 2026

Uh oh!

khushali9 commented Mar 11, 2026

Uh oh!

khushali9 commented Mar 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

khushali9 commented Mar 17, 2026

Uh oh!

khushali9 commented Mar 23, 2026

Uh oh!

khushali9 commented Mar 25, 2026

Uh oh!

SunMarc commented Mar 26, 2026

Uh oh!

HuggingFaceDocBuilderDev commented Mar 26, 2026

Uh oh!

Uh oh!

khushali9 commented Mar 26, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

khushali9 commented Feb 19, 2026 •

edited

Loading

SunMarc left a comment •

edited

Loading

khushali9 commented Mar 16, 2026 •

edited

Loading