[transformers] set return dict false for transformers v5 compatibility by erictang000 · Pull Request #1325 · NovaSky-AI/SkyRL

erictang000 · 2026-03-14T18:20:07Z

Overview

This PR makes 2 changes for transformers-v5 compatibility:

Sets return_dict=False where needed for transformers-v5 compatibility - this can be merged prior to explicitly upgrading to transformers v5 in the pyproject.toml, since vllm still does not technically fully support it in the latest release. (This change should be backwards compatible with transformers v4.*, since the behavior prior to v5 was that the default value for return_dict was None, which was interpreted as False.)
check if fsdp_transformer_layer_cls_to_wrap is a set for v5 compatibility while maintaining backwards compatibility

huggingface/transformers breaking PR return_dict=false PR for v5: huggingface/transformers#42567

Validation

Checked all CPU tests are passing both with transformers < 5.0.0 and for transformers==5.3.0, and checked that examples/train/gsm8k/run_gsm8k runs for both old/new transformers.

erictang000 · 2026-03-19T19:11:17Z

    # For a simple chat template, the fixed base approach is expected to behave the same
-    # as `apply_chat_template`
-    expected_token_ids = tokenizer_w_dummy_template.apply_chat_template(messages)
+    # as `apply_chat_template`.  We compare decoded strings rather than raw token IDs


this comparison between full chat template vs stripping base only happens in this test and not in the SkyRLGymGenerator

is it broken by v5? I wonder why it doesn't surface until now

yeah i asked claude it told me this was the relevant PR: huggingface/transformers#40936 and that in that PR

LlamaTokenizer was rewritten to inherit from TokenizersBackend (Rust) instead of the old Python SentencePiece backend legacy was changed from defaulting to True to defaulting to False The _get_prepend_scheme helper was added to select "first" vs "always" based on the legacy flag

CharlieFRuan

Thank you!

set return dict false where needed

11d43e1

This comment was marked as resolved.

Sign in to view

x

f57b23b

erictang000 commented Mar 19, 2026

View reviewed changes

This comment was marked as resolved.

Sign in to view

erictang000 added 2 commits March 19, 2026 21:18

x

8d8d4c4

x

d7be63e

erictang000 requested a review from CharlieFRuan March 19, 2026 22:29

CharlieFRuan approved these changes Mar 20, 2026

View reviewed changes

erictang000 merged commit b2242a0 into NovaSky-AI:main Mar 20, 2026
5 of 6 checks passed

erictang000 deleted the return_dict_false branch March 20, 2026 23:38

SumanthRH mentioned this pull request Apr 15, 2026

[CI] Migrate non-Megatron GPU CI to run on new inference codepath #1476

Merged

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[transformers] set return dict false for transformers v5 compatibility#1325

[transformers] set return dict false for transformers v5 compatibility#1325
erictang000 merged 4 commits intoNovaSky-AI:mainfrom
erictang000:return_dict_false

erictang000 commented Mar 14, 2026 •

edited

Loading

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

Uh oh!

erictang000 Mar 19, 2026

Uh oh!

CharlieFRuan Mar 20, 2026

Uh oh!

erictang000 Mar 20, 2026

Uh oh!

This comment was marked as resolved.

Uh oh!

CharlieFRuan left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

erictang000 commented Mar 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Overview

Validation

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

Uh oh!

erictang000 Mar 19, 2026

Choose a reason for hiding this comment

Uh oh!

CharlieFRuan Mar 20, 2026

Choose a reason for hiding this comment

Uh oh!

erictang000 Mar 20, 2026

Choose a reason for hiding this comment

Uh oh!

This comment was marked as resolved.

Uh oh!

CharlieFRuan left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

erictang000 commented Mar 14, 2026 •

edited

Loading