remove vision2seq vs image-text-to-text by Vaibhavs10 · Pull Request #40695 · huggingface/transformers

Vaibhavs10 · 2025-09-04T14:30:15Z

reference internal chat: https://huggingface.slack.com/archives/C070Q9GGGGY/p1756985150983899

Qwen models are tagged as both vision2seq as well as image-text-to-text, removing the former since it's deprecated.

ref: https://huggingface.slack.com/archives/C070Q9GGGGY/p1756985150983899

github-actions · 2025-09-04T14:31:51Z

[For maintainers] Suggested jobs to run (before merge)

run-slow: auto

HuggingFaceDocBuilderDev · 2025-09-04T14:39:53Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

zucchini-nlp

Hm, I think this is quite breaking. The class is deprecated, yet some users can be still using it because we raise warning until v5

Isn't the hub snippet issue fixed in hub side?

pcuenca · 2025-09-08T17:53:46Z

Oh yes, this is breaking.

Additional context: https://huggingface.co/datasets/huggingface/transformers-metadata/discussions/7

Other alternatives:

Create a workaround around here.
Leave it unchanged until v5. As far as I know, generate and the snippet would still work even if the model is instantiated with AutoModelForVision2Seq.

remove vision2seq vs image-text-to-text

61e65b5

ref: https://huggingface.slack.com/archives/C070Q9GGGGY/p1756985150983899

Vaibhavs10 requested review from LysandreJik, merveenoyan and zucchini-nlp September 4, 2025 14:30

zucchini-nlp reviewed Sep 4, 2025

View reviewed changes

This was referenced Apr 29, 2026

Cumulative feature and defect updates from recent Transformers PRs evalstate/transformers#42

Open

Cumulative defect fixes from recent Transformers PRs evalstate/transformers#43

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

remove vision2seq vs image-text-to-text#40695

remove vision2seq vs image-text-to-text#40695
Vaibhavs10 wants to merge 1 commit intomainfrom
correct-mapping-vision2seq-->-image-text-to-text

Vaibhavs10 commented Sep 4, 2025

Uh oh!

github-actions Bot commented Sep 4, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Sep 4, 2025

Uh oh!

zucchini-nlp left a comment •

edited

Loading

Uh oh!

pcuenca commented Sep 8, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

Vaibhavs10 commented Sep 4, 2025

Uh oh!

github-actions Bot commented Sep 4, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Sep 4, 2025

Uh oh!

zucchini-nlp left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

pcuenca commented Sep 8, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

zucchini-nlp left a comment •

edited

Loading