Skip to content

remove vision2seq vs image-text-to-text#40695

Open
Vaibhavs10 wants to merge 1 commit intomainfrom
correct-mapping-vision2seq-->-image-text-to-text
Open

remove vision2seq vs image-text-to-text#40695
Vaibhavs10 wants to merge 1 commit intomainfrom
correct-mapping-vision2seq-->-image-text-to-text

Conversation

@Vaibhavs10
Copy link
Copy Markdown
Contributor

reference internal chat: https://huggingface.slack.com/archives/C070Q9GGGGY/p1756985150983899

Qwen models are tagged as both vision2seq as well as image-text-to-text, removing the former since it's deprecated.

@github-actions
Copy link
Copy Markdown
Contributor

github-actions Bot commented Sep 4, 2025

[For maintainers] Suggested jobs to run (before merge)

run-slow: auto

@HuggingFaceDocBuilderDev
Copy link
Copy Markdown

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

Copy link
Copy Markdown
Member

@zucchini-nlp zucchini-nlp left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Hm, I think this is quite breaking. The class is deprecated, yet some users can be still using it because we raise warning until v5

Isn't the hub snippet issue fixed in hub side?

@pcuenca
Copy link
Copy Markdown
Member

pcuenca commented Sep 8, 2025

Oh yes, this is breaking.

Additional context: https://huggingface.co/datasets/huggingface/transformers-metadata/discussions/7

Other alternatives:

  • Create a workaround around here.
  • Leave it unchanged until v5. As far as I know, generate and the snippet would still work even if the model is instantiated with AutoModelForVision2Seq.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants