Fix train_batch_size and eval_batch_size to respect split_batches config by MinuriRajapakse · Pull Request #45694 · huggingface/transformers

MinuriRajapakse · 2026-04-29T07:02:01Z

Problem

When split_batches=True is set in accelerator_config, the
train_batch_size and eval_batch_size properties were still
multiplying per_device_batch_size by n_gpu, which is incorrect.

When split_batches=True, the batch is split across devices rather
than replicated, so the total batch size equals per_device_batch_size
directly.

Fix

Added a check for split_batches in both train_batch_size and
eval_batch_size properties in TrainingArguments.

Testing

Added a new test test_batch_size_respects_split_batches
All 26 existing + new tests pass

Rocketknight1 · 2026-04-29T10:05:40Z

cc @SunMarc

Fix train_batch_size and eval_batch_size to respect split_batches config

936f92c

MinuriRajapakse mentioned this pull request Apr 29, 2026

Why the calculation of train_batch_size unrelated to split_batches #45693

Open

This was referenced Apr 29, 2026

Cumulative feature and defect updates from recent Transformers PRs evalstate/transformers#42

Open

Cumulative defect fixes from recent Transformers PRs evalstate/transformers#43

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix train_batch_size and eval_batch_size to respect split_batches config#45694

Fix train_batch_size and eval_batch_size to respect split_batches config#45694
MinuriRajapakse wants to merge 1 commit intohuggingface:mainfrom
MinuriRajapakse:main

MinuriRajapakse commented Apr 29, 2026

Uh oh!

Rocketknight1 commented Apr 29, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

MinuriRajapakse commented Apr 29, 2026

Problem

Fix

Testing

Uh oh!

Rocketknight1 commented Apr 29, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants