Remove redundant alignment of pad_token_id by albertvillanova · Pull Request #5487 · huggingface/trl

albertvillanova · 2026-04-09T16:39:40Z

Remove redundant alignment of pad_token_id.

Note that this alignment is already done by transformers.Trainer.train() → self.align_special_tokens() / self._align_special_tokens()

This PR removes redundant alignment of pad_token_id between the tokenizer and the mdoel config, relying instead on the transformers.Trainer.align_special_tokens(), called during training.

Changes

Padding token management cleanup:

Removed explicit assignment of model.config.pad_token_id = tokenizer.pad_token_id in examples/scripts/prm.py.
Removed explicit assignment of model.config.pad_token_id = processing_class.pad_token_id in the RewardTrainer initialization.

Note

Low Risk
Low risk cleanup that removes duplicate pad_token_id assignments and relies on Transformers' built-in special-token alignment during training; behavior could change only if code paths depend on the earlier manual override before Trainer.train() runs.

Overview
Removes manual alignment of model.config.pad_token_id with the tokenizer/processing class in both examples/scripts/prm.py and RewardTrainer, relying on Transformers' Trainer special-token alignment instead.

This reduces redundant configuration mutation and keeps padding token handling centralized in the upstream training flow.

^{Reviewed by Cursor Bugbot for commit bcd4e6c. Bugbot is set up for automated code reviews on this repo. Configure here.}

HuggingFaceDocBuilderDev · 2026-04-09T16:42:16Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

cursor

Cursor Bugbot has reviewed your changes and found 1 potential issue.

^{❌ Bugbot Autofix is OFF. To automatically fix reported issues with cloud agents, enable autofix in the Cursor dashboard.}

^{Reviewed by Cursor Bugbot for commit bcd4e6c. Configure here.}

cursor · 2026-04-09T16:49:02Z

                "in the vocabulary before using it as a padding token."
            )
        processing_class.pad_token = pad_token
-        model.config.pad_token_id = processing_class.pad_token_id


Removal relies on feature absent in minimum transformers version

Medium Severity

The removed model.config.pad_token_id = processing_class.pad_token_id assignment relies on Trainer._align_special_tokens() to handle alignment automatically, but this mechanism only exists in transformers v5.5.0+ (main branch). The project's minimum supported version is transformers>=4.56.2, where no such alignment occurs. For models with config.pad_token_id = None (e.g., Llama, Mistral), sequence classification models use config.pad_token_id during forward-pass pooling — without alignment, this can cause ValueError for batch sizes > 1 or incorrect pooling on older supported versions.

Additional Locations (1)

examples/scripts/prm.py#L90-L91

^{Reviewed by Cursor Bugbot for commit bcd4e6c. Configure here.}

I have checked transformers code:

_align_special_tokens was implemented in v4.56.0 by:

[trainer] ensure special tokens in model configs are aligned with tokenizer at train time transformers#38441

it was renamed to align_special_tokens in v5.2.0 by:

Minor changes trainer transformers#43744

qgallouedec · 2026-04-09T16:58:14Z

can you quickly check when _align_special_tokens was introduced in trainer? it's not that very old iirc

albertvillanova · 2026-04-09T17:00:17Z

Done @qgallouedec. See: #5487 (comment)

qgallouedec

lgtm

Remove redundant alignment of special tokens

bcd4e6c

cursor Bot reviewed Apr 9, 2026

View reviewed changes

qgallouedec approved these changes Apr 9, 2026

View reviewed changes

albertvillanova merged commit 8900a14 into huggingface:main Apr 9, 2026
14 of 16 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove redundant alignment of pad_token_id#5487

Remove redundant alignment of pad_token_id#5487
albertvillanova merged 1 commit intohuggingface:mainfrom
albertvillanova:rm-redundant_align_special_tokens

albertvillanova commented Apr 9, 2026 •

edited

Loading

Uh oh!

HuggingFaceDocBuilderDev commented Apr 9, 2026

Uh oh!

cursor Bot left a comment

Uh oh!

cursor Bot Apr 9, 2026

Uh oh!

albertvillanova Apr 9, 2026

Uh oh!

qgallouedec commented Apr 9, 2026

Uh oh!

albertvillanova commented Apr 9, 2026

Uh oh!

qgallouedec left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

albertvillanova commented Apr 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes

Uh oh!

HuggingFaceDocBuilderDev commented Apr 9, 2026

Uh oh!

cursor Bot left a comment

Choose a reason for hiding this comment

Uh oh!

cursor Bot Apr 9, 2026

Choose a reason for hiding this comment

Removal relies on feature absent in minimum transformers version

Uh oh!

albertvillanova Apr 9, 2026

Choose a reason for hiding this comment

Uh oh!

qgallouedec commented Apr 9, 2026

Uh oh!

albertvillanova commented Apr 9, 2026

Uh oh!

qgallouedec left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

albertvillanova commented Apr 9, 2026 •

edited

Loading