[qwen-vl] Standardize config by zucchini-nlp · Pull Request #37268 · huggingface/transformers

zucchini-nlp · 2025-04-04T06:55:43Z

What does this PR do?

BC is kept and the models can be loaded as before. All attributes are still available though general config (config.vocab_size) but in model code we use config.text_config now

Separating out the text config from the general config allows us to support multimodality and text as separate entities, with their own base classes and configs. Related to #37033

github-actions · 2025-04-04T06:55:59Z

Hi 👋, thank you for opening this pull request! The pull request is converted to draft by default. The CI will be paused while the PR is in draft mode. When it is ready for review, please click the Ready for review button (at the bottom of the PR page). This will assign reviewers and trigger CI.

HuggingFaceDocBuilderDev · 2025-04-04T07:22:07Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

ArthurZucker

Thanks for fixing! We tlaked about this a while ago!

* update * fix tests * fixup * update * skip this one * fixup * fix

…` attribute See huggingface#37268 for details about changes in Qwen2VL's config.

* update * fix tests * fixup * update * skip this one * fixup * fix

* feat: add colqwen2 (wip) * tests: fix test_attention_outputs * tests: reduce hidden size to accelerate tests * tests: fix `test_attention_outputs` 🥳 * fix: fix wrong parent class for `ColQwen2ForRetrievalOutput` * fix: minor typing and style changes * chore: run `make style` * feat: remove redundant `max_num_visual_tokens` attribute in `ColQwen2Processor` * tests: tweak comments * style: apply ruff formatter * feat: move default values for `visual_prompt_prefix` and `query_prefix` * docs: update ColQwen2 model card * docs: tweak model cards * docs: add required example config checkpoint * tests: update expected scores in integration test * docs: tweak quickstart snippets * fix: address PR comments * tests: fix colqwen2 tests + tweak comment in colpali test * tests: unskip useful tests * fix: fix bug when `visual_prompt_prefix` or `query_prefix` is an empty string * fix: fix ColPali outputs when `return_dict == False` * fix: fix issue with PaliGemma output not being a dict * docs: set default dtype to bfloat16 in quickstart snippets * fix: fix error when `return_dict=False` in ColPali and ColQwen2 * tests: fix special tokens not being replaced in input_ids * style: fix lint * fix: `ColQwen2Processor`'s `padding_side` is now set from `processor_config.json` * fix: remove unused `padding_side` in ColQwen2 model * docs: update ColQwen2's model doc * fix: fix harcoded vlm backbone class in ColQwen2Config * fix: remove `padding_side` from ColQwen2Processor as should fed from kwargs * docs: fix typo in model docstring * docs: add illuin mention in model docs * fix: let `padding_size` be handled by `tokenizer_config.json` * docs: add colpali reference url in colqwen2's model doc * docs: add Hf mention in model docs * docs: add late interaction mention in model docs * docs: tweak colqwen2 model doc * docs: update reference checkpoint for ColPali to v1.3 * docs: simplify quickstart snippets * docs: remove redundant `.eval()` * refactor: use `can_return_tuple` decorator for ColPali and ColQwen2 * docs: fix copyright date * docs: add missing copyright in tests * fix: raise error when `initializer_range` is not in config * docs: remove redundant `.eval()` in colpali doc * fix: fix `get_text_config` now that Qwen2VL has a proper `text_config` attribute See #37268 for details about changes in Qwen2VL's config. * fix: add missing `initializer_range` attribute in `ColQwen2Config` * fix: use `get_text_config` in `resize_token_embeddings` * update colwen2 with auto_docstring * docs: fix wrong copyright year * chore: remove `raise` as `initializer_range` has a default value in `ColQwen2Config` * refactor: merge `inner_forward` into `forward` * Refactor colqwen2 after refactoring of qwen2VL, use modular for modeling code * protect torch import in modular to protect in processing * protect torch import in modular to protect in processing * tests: fix hf model path in ColQwen2 integration test * docs: clarify `attn_implementation` and add comments * docs: add fallback snippet for using offline PIL dummy images * docs: temporarily revert attn_implementation to `None` while sdpa is not fixed * docs: tweaks in colpali/colqwen2 quick start snippets * fix: add missing flags to enable SDPA/Flex Attention in ColQwen2 model * fix: add missing changes in modular file * fix modeling tests --------- Co-authored-by: yonigozlan <yoni.gozlan@huggingface.co>

update

8a4242b

github-actions Bot marked this pull request as draft April 4, 2025 06:55

Merge branch 'main' into qwen-clean-configs

369712d

zucchini-nlp requested a review from ArthurZucker April 4, 2025 06:56

zucchini-nlp marked this pull request as ready for review April 4, 2025 06:56

zucchini-nlp added 3 commits April 4, 2025 14:29

fix tests

a0e2bcc

fixup

e302d2f

update

0cd30e0

ArthurZucker approved these changes Apr 8, 2025

View reviewed changes

zucchini-nlp added 11 commits April 14, 2025 11:26

merge main

c1c3a34

skip this one

bc14bc4

Merge branch 'main' into qwen-clean-configs

95a33ac

Merge branch 'main' into qwen-clean-configs

7525695

fixup

df1f984

fix

3d6cd85

Merge branch 'main' into qwen-clean-configs

da79b6e

Merge branch 'main' into qwen-clean-configs

bd66386

Merge branch 'main' into qwen-clean-configs

a0cff5b

Merge branch 'main' into qwen-clean-configs

df6fb35

Merge branch 'main' into qwen-clean-configs

b42c07a

zucchini-nlp merged commit 3bc44ea into huggingface:main Apr 17, 2025
20 checks passed

cyr0930 pushed a commit to cyr0930/transformers that referenced this pull request Apr 18, 2025

[qwen-vl] Standardize config (huggingface#37268)

38c8a16

* update * fix tests * fixup * update * skip this one * fixup * fix

zucchini-nlp mentioned this pull request Apr 25, 2025

Support multimodal models in vLLM with transformers backend #37780

Closed

7 tasks

tonywu71 mentioned this pull request Apr 29, 2025

Add ColQwen2 to 🤗 transformers #35778

Merged

14 tasks

tonywu71 added a commit to tonywu71/transformers that referenced this pull request Apr 29, 2025

fix: fix get_text_config now that Qwen2VL has a proper `text_config…

eaa797b

…` attribute See huggingface#37268 for details about changes in Qwen2VL's config.

zucchini-nlp added a commit to zucchini-nlp/transformers that referenced this pull request May 14, 2025

[qwen-vl] Standardize config (huggingface#37268)

a26e3e1

* update * fix tests * fixup * update * skip this one * fixup * fix

This was referenced May 22, 2025

CI failure due to transformers VLM change linkedin/Liger-Kernel#723

Closed

AttributeError: 'Qwen2VLConfig' object has no attribute 'hidden_size' #38331

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[qwen-vl] Standardize config#37268

[qwen-vl] Standardize config#37268
zucchini-nlp merged 16 commits intohuggingface:mainfrom
zucchini-nlp:qwen-clean-configs

zucchini-nlp commented Apr 4, 2025

Uh oh!

github-actions Bot commented Apr 4, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Apr 4, 2025

Uh oh!

ArthurZucker left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

zucchini-nlp commented Apr 4, 2025

What does this PR do?

Uh oh!

github-actions Bot commented Apr 4, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Apr 4, 2025

Uh oh!

ArthurZucker left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants