Token by ArkVex · Pull Request #40857 · huggingface/transformers

ArkVex · 2025-09-12T19:10:26Z

What does this PR do?

This PR fixes the calculation of train_tokens_per_second when resuming training from a checkpoint. Previously, the metric was calculated using global state, which could result in unrealistically high values after resuming. Now, the timer and token counters are reset when resuming, so the metric reflects only the current training session.

Fixes #40560

Before submitting

This PR addresses a bug in the Trainer metrics.
Discussed in issue train_tokens_per_second is wrong after continuing from checkpoint #40560.
No new dependencies.
No documentation changes required.
No new tests added, but existing metric logic is covered.

Who can review?

@zach-huggingface @ArthurZucker ,

…oken

ArkVex · 2025-09-12T19:12:36Z

Hey @ArthurZucker i have another pr open what shall i do?
If you merge this both will get merged right?

ArthurZucker

hey, sorry I don't understandyour question as I only see this PR linked to this issue

ArkVex added 5 commits August 31, 2025 03:30

megatron_bert model card update

656bd00

Merge branch 'huggingface:main' into main

f5d3f1e

Merge branch 'huggingface:main' into main

7d26880

reset timer and token counters for train_tokens_per_second

8003790

Merge branch 'token' of https://github.com/ArkVex/transformers into t…

28f64d2

…oken

ArthurZucker reviewed Sep 15, 2025

View reviewed changes

Comment thread docs/source/en/model_doc/megatron_bert.md

This was referenced Apr 29, 2026

Cumulative feature and defect updates from recent Transformers PRs evalstate/transformers#42

Open

Cumulative defect fixes from recent Transformers PRs evalstate/transformers#43

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Token#40857

Token#40857
ArkVex wants to merge 5 commits intohuggingface:mainfrom
ArkVex:token

ArkVex commented Sep 12, 2025

Uh oh!

ArkVex commented Sep 12, 2025

Uh oh!

ArthurZucker left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

ArkVex commented Sep 12, 2025

What does this PR do?

Before submitting

Who can review?

Uh oh!

ArkVex commented Sep 12, 2025

Uh oh!

ArthurZucker left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants