🗜 Hotfix: avoid passing `quantization_config=None` by qgallouedec · Pull Request #4019 · huggingface/trl

qgallouedec · 2025-09-05T22:39:28Z

passing

model = AutoModelForCausalLM.from_pretrained("my_model", quantization_config=None)

isn't the same as

model = AutoModelForCausalLM.from_pretrained("my_model")

causing gpt oss model to fail loading when used with trl cli

HuggingFaceDocBuilderDev · 2025-09-05T22:44:03Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

albertvillanova

Thanks for the PR: definitely, GPT OSS models should be supported in TRL!!!

That said, I'm not fully convinced we need to apply the "filter out all None kwargs" policy across the board. Most parameters in from_pretrained are robust to None: have you seen any issue with any parameter other than quantization_config?

In my view, there are two complementary points here:

Upstream issue in Transformers: do you know why transformers treat differently missing and None quantization_config? I think this might be a bug. In my opinion, passing None should be treated the same as omitting the key altogether, not as an invalid quantization config. It would be worth raising or linking an issue upstream so we can align behavior there.
- I can have a look at this.
Targeted fix in TRL: While investigating and waiting for an upstream fix, it makes sense for TRL to guard specifically against this problem.

What about stripping only the quantization_config (and associated device_map) if it is None. This would keep the fix minimal, and avoid unnecessarily rewriting the kwargs for unrelated arguments. What do you think?

qgallouedec · 2025-09-09T19:50:04Z

yeah good point.

have you seen any issue with any parameter other than quantization_config?

no

do you know why transformers treat differently missing and None quantization_config?

I think it comes from these lines, probably a bug, but I haven't had time to look into it further.

https://github.com/huggingface/transformers/blob/37c14430c99edca79dfcdcb76f1209f291b12fab/src/transformers/configuration_utils.py#L962-L967

Targeted fix in TRL: While investigating and waiting for an upstream fix, it makes sense for TRL to guard specifically against this problem.

yep, agree, done

I will merge this PR now, as I'd like to include it in the release, but we should definitely do this

It would be worth raising or linking an issue upstream so we can align behavior there.

Fix passing model kwargs

96a7344

qgallouedec requested review from albertvillanova, edbeeching, kashif and lewtun September 5, 2025 22:39

Merge branch 'main' into fix-passing-model-kwargs

cefd69f

albertvillanova reviewed Sep 9, 2025

View reviewed changes

Comment thread trl/trainer/gkd_trainer.py

Comment thread trl/trainer/online_dpo_trainer.py

qgallouedec and others added 2 commits September 9, 2025 13:24

Merge branch 'main' into fix-passing-model-kwargs

9c94012

focus on quantization config

64532fd

qgallouedec added 2 commits September 9, 2025 19:50

style

de4d583

add some missing

cd70e8b

qgallouedec changed the title ~~Fix passing model kwargs~~ 🗜 Hotfix: avoid passing quantization_config=None Sep 9, 2025

qgallouedec merged commit a647e5a into main Sep 9, 2025
9 of 11 checks passed

qgallouedec deleted the fix-passing-model-kwargs branch September 9, 2025 20:50

albertvillanova mentioned this pull request Sep 10, 2025

Fix None quantization_config equivalence with omitted param in AutoModel.from_pretrained huggingface/transformers#40783

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

🗜 Hotfix: avoid passing `quantization_config=None`#4019

🗜 Hotfix: avoid passing `quantization_config=None`#4019
qgallouedec merged 6 commits intomainfrom
fix-passing-model-kwargs

qgallouedec commented Sep 5, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Sep 5, 2025

Uh oh!

albertvillanova left a comment •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

qgallouedec commented Sep 9, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

qgallouedec commented Sep 5, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Sep 5, 2025

Uh oh!

albertvillanova left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

qgallouedec commented Sep 9, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

albertvillanova left a comment •

edited

Loading