[V5] Return a BatchEncoding dict from apply_chat_template by default again by Rocketknight1 · Pull Request #42567 · huggingface/transformers

Rocketknight1 · 2025-12-02T16:49:43Z

This is basically PR #41626 again! Some of it got clobbered in the tokenizer refactor, but it's just as good the second time.

…nderlying tokenizer

…useful

github-actions · 2025-12-02T16:50:37Z

[For maintainers] Suggested jobs to run (before merge)

run-slow: blenderbot, bloom, cohere, gpt2, gpt_sw3

HuggingFaceDocBuilderDev · 2025-12-02T17:06:26Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

Rocketknight1 · 2025-12-03T15:16:13Z

cc @LysandreJik - this was one of the V5 PRs before, do I need to do anything special with this one, or can we just merge it to main?

zucchini-nlp

great, it was already approved once so lgtm 😄

zucchini-nlp · 2025-12-04T14:23:32Z

+        if not tokenize:
+            return_dict = False  # dicts are only returned by the tokenizer anyway


makes me wonder, do we need to support a combination of tokenize=True, return_dict=False or can we deprecate/remove return_dict over time? Can't think of cases when users want a list of tokens as output

Maybe we can get rid of it over time, but I think it's fine as a backward compatibility flag for now!

sure, i meant after v5 + several more minor releases, and if users are fine with it

…again (huggingface#42567) * Flip the default return type for `apply_chat_template` to match the underlying tokenizer * Remove test_tokenization_for_chat tests, which no longer do anything useful * Remove test_tokenization_for_chat tests, which no longer do anything useful * Fix test_encode_message tests * Fix test_encode_message tests * nit fix * Trigger tests * Remove test_tokenization_for_chat * make fixup * Add a little test to make sure that doesn't happen again * make fixup

#1325) # Overview This PR makes 2 changes for transformers-v5 compatibility: - Sets `return_dict=False` where needed for transformers-v5 compatibility - this can be merged prior to explicitly upgrading to transformers v5 in the pyproject.toml, since vllm still does not technically fully support it in the latest release. (This change should be backwards compatible with transformers v4.*, since the behavior prior to v5 was that the default value for `return_dict` was `None`, which was interpreted as False.) - check if fsdp_transformer_layer_cls_to_wrap is a set for v5 compatibility while maintaining backwards compatibility huggingface/transformers breaking PR `return_dict=false` PR for v5: huggingface/transformers#42567 ## Validation Checked all CPU tests are passing both with transformers < 5.0.0 and for transformers==5.3.0, and checked that `examples/train/gsm8k/run_gsm8k` runs for both old/new transformers.  --- <a href="https://app.devin.ai/review/novasky-ai/skyrl/pull/1325" target="_blank"> <picture> <source media="(prefers-color-scheme: dark)" srcset="https://static.devin.ai/assets/gh-open-in-devin-review-dark.svg?v=1"> <img src="https://static.devin.ai/assets/gh-open-in-devin-review-light.svg?v=1" alt="Open with Devin"> </picture> </a>

Rocketknight1 added 9 commits December 2, 2025 16:44

Flip the default return type for apply_chat_template to match the u…

e31e466

…nderlying tokenizer

Remove test_tokenization_for_chat tests, which no longer do anything …

df855fe

…useful

Remove test_tokenization_for_chat tests, which no longer do anything …

52f0028

…useful

Fix test_encode_message tests

caf7303

Fix test_encode_message tests

51f85bd

nit fix

4117af1

Trigger tests

9ac4580

Remove test_tokenization_for_chat

3d595c6

make fixup

67797dc

Rocketknight1 marked this pull request as ready for review December 2, 2025 16:50

Rocketknight1 added 2 commits December 2, 2025 16:55

Add a little test to make sure that doesn't happen again

da06bdd

make fixup

a697124

zucchini-nlp approved these changes Dec 4, 2025

View reviewed changes

Rocketknight1 merged commit ce53cc0 into main Dec 4, 2025
24 checks passed

Rocketknight1 deleted the v5_chat_template_return_type branch December 4, 2025 14:44

harshaljanjani mentioned this pull request Mar 14, 2026

fix(testing): Fix Kyutai Speech-To-Text and LongCatFlash test failures on main CI #44695

Merged

5 tasks

erictang000 mentioned this pull request Mar 19, 2026

[transformers] set return dict false for transformers v5 compatibility NovaSky-AI/SkyRL#1325

Merged

plmsmile mentioned this pull request Mar 24, 2026

[Bug] ValueError: input_ids should be a list of lists for batch processing. sgl-project/sglang#19435

Open

5 tasks

harshaljanjani mentioned this pull request Mar 25, 2026

fix(testing): Fix Parakeet, Evolla, Pi0, and Phi-3 test failures on main CI #45004

Merged

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[V5] Return a BatchEncoding dict from apply_chat_template by default again#42567

[V5] Return a BatchEncoding dict from apply_chat_template by default again#42567
Rocketknight1 merged 11 commits intomainfrom
v5_chat_template_return_type

Rocketknight1 commented Dec 2, 2025

Uh oh!

github-actions Bot commented Dec 2, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Dec 2, 2025

Uh oh!

Rocketknight1 commented Dec 3, 2025

Uh oh!

zucchini-nlp left a comment

Uh oh!

zucchini-nlp Dec 4, 2025

Uh oh!

Rocketknight1 Dec 4, 2025

Uh oh!

zucchini-nlp Dec 4, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		if not tokenize:
		return_dict = False # dicts are only returned by the tokenizer anyway

Conversation

Rocketknight1 commented Dec 2, 2025

Uh oh!

github-actions Bot commented Dec 2, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Dec 2, 2025

Uh oh!

Rocketknight1 commented Dec 3, 2025

Uh oh!

zucchini-nlp left a comment

Choose a reason for hiding this comment

Uh oh!

zucchini-nlp Dec 4, 2025

Choose a reason for hiding this comment

Uh oh!

Rocketknight1 Dec 4, 2025

Choose a reason for hiding this comment

Uh oh!

zucchini-nlp Dec 4, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants