🚨 Delete generation params from model config #41695

zucchini-nlp · 2025-10-17T15:24:16Z

What does this PR do?

Disentangles config and generation config. Now the model's config will throw an error if generation params are found. To keep BC, we'll still be loading models from the hub where generation params are in config, but we'll manually drop these params from being set as an attribute.

… happens

HuggingFaceDocBuilderDev · 2025-10-17T15:33:43Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

zucchini-nlp · 2025-10-21T13:35:10Z

docs/source/en/model_doc/qwen2_audio.md

 url = "https://qianwen-res.oss-cn-beijing.aliyuncs.com/Qwen-Audio/glass-breaking-151256.mp3"
 audio, sr = librosa.load(BytesIO(urlopen(url).read()), sr=processor.feature_extractor.sampling_rate)
-inputs = processor(text=prompt, audios=audio, return_tensors="pt").to(model.device)
+inputs = processor(text=prompt, audio=audio, return_tensors="pt").to(model.device)


Audio renaming is not related, accidentally slipped from another branch. I'll rebase main after #41681 is merged

zucchini-nlp · 2025-11-17T15:17:13Z

cc @vasqu maybe since Joao left and I need someone to help merge generation PRs. This one particularly is blocking me. I'd love to clean-up and break BC for v5, since generation config is too complicated at this point

This PR particularly splits generation config away from model config, a dependency we had for a few ages due to legacy reasons. It's time to move on I think and raise a ValueError if users add manually generation params in a model config

NOTE: bad formatted configs from hub can still be loaded for BC

vasqu

I'm in general aligned with this, just have a few nits/questions just to be sure.

Don't forget

the bad audio(s)
v5 tag and add it to the issue

src/transformers/generation/configuration_utils.py

src/transformers/generation/utils.py

src/transformers/modeling_utils.py

src/transformers/models/bart/configuration_bart.py

tests/utils/test_modeling_utils.py

Co-authored-by: Anton Vlasjuk <73884904+vasqu@users.noreply.github.com>

vasqu

Thx, lgtm

Time to merge with main and check conflicts 👀 just the same things as I said before re audios + v5

github-actions · 2025-11-17T17:26:46Z

[For maintainers] Suggested jobs to run (before merge)

run-slow: bart, encoder_decoder, mvp, rag, speech_encoder_decoder, udop, vision_encoder_decoder

zucchini-nlp · 2025-11-18T10:20:20Z

Rebased main, now the diff has no audio-related changes. Added the v5 tag and a rotation light as it's a bit breaking. Merging

i am so confused, too many circular dependencies. Delete and see what…

4a47d6c

… happens

zucchini-nlp mentioned this pull request Oct 17, 2025

Untangle config inheritance #41541

Open

zucchini-nlp added 9 commits October 17, 2025 18:00

pop if exists

34a5909

fix a few tests

b98e099

fix loading generation params from model config

43bbdc2

oh no, revert this

213ea36

replace audios with audio in docs

410eaa7

fix tests

5947542

Merge branch 'main' into generation-params-in-config

69e4363

fix last test

0d0e48e

i am dumb, typo

6d2a297

zucchini-nlp requested a review from ArthurZucker October 21, 2025 13:30

zucchini-nlp commented Oct 21, 2025

View reviewed changes

zucchini-nlp changed the title ~~Delete generation params from model config~~ 🚨 Delete generation params from model config Nov 17, 2025

zucchini-nlp requested a review from vasqu November 17, 2025 15:17

vasqu reviewed Nov 17, 2025

View reviewed changes

zucchini-nlp and others added 2 commits November 17, 2025 17:18

Update src/transformers/generation/configuration_utils.py

ca84022

Co-authored-by: Anton Vlasjuk <73884904+vasqu@users.noreply.github.com>

Update tests/utils/test_modeling_utils.py

ae70fa9

Co-authored-by: Anton Vlasjuk <73884904+vasqu@users.noreply.github.com>

zucchini-nlp added the for_v5? label Nov 17, 2025

vasqu approved these changes Nov 17, 2025

View reviewed changes

Merge branch 'main' into generation-params-in-config

1b79c23

zucchini-nlp merged commit b1bdf9c into huggingface:main Nov 18, 2025
23 checks passed

albertvillanova mentioned this pull request Nov 28, 2025

CI fails with dev dependencies: AttributeError: 'GPTNeoXForSequenceClassification' object has no attribute 'generation_config' huggingface/trl#4595

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

🚨 Delete generation params from model config #41695

🚨 Delete generation params from model config #41695

Uh oh!

zucchini-nlp commented Oct 17, 2025 •

edited

Loading

Uh oh!

HuggingFaceDocBuilderDev commented Oct 17, 2025

Uh oh!

zucchini-nlp Oct 21, 2025

Uh oh!

zucchini-nlp commented Nov 17, 2025

Uh oh!

vasqu left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

vasqu left a comment

Uh oh!

github-actions bot commented Nov 17, 2025

Uh oh!

zucchini-nlp commented Nov 18, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

🚨 Delete generation params from model config #41695

🚨 Delete generation params from model config #41695

Uh oh!

Conversation

zucchini-nlp commented Oct 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Uh oh!

HuggingFaceDocBuilderDev commented Oct 17, 2025

Uh oh!

zucchini-nlp Oct 21, 2025

Choose a reason for hiding this comment

Uh oh!

zucchini-nlp commented Nov 17, 2025

Uh oh!

vasqu left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

vasqu left a comment

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Nov 17, 2025

Uh oh!

zucchini-nlp commented Nov 18, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

zucchini-nlp commented Oct 17, 2025 •

edited

Loading