Multiple fixes to FA tests in AMD by remi-or · Pull Request #40498 · huggingface/transformers

remi-or · 2025-08-27T15:04:58Z

This PR fixes a tests across a few models in the following ways:

solves a multi-device issue in qwen2_5_omni
adds AMD expectations to gemma3, qwen2_5_omni and qwen2_5_vl
fixes an issue in the qwen2_5_omni and qwen2_5_vl FA tests: the test changes the hidden_size so it breaks compatibility with mrope. For this issue, I added a fix using try and except to ensure the change to mrope_section does not propagate to the rest of the tests. I am not sure this is the best way, so tagging @ydshieh
removed some mutables that were used a default arguments

This PR touches mostly test-related files, so I think it's ok to bundle everything, but please let me know if not. As the non test-related stuff this is mostly multimodal, cc. @zucchini-nlp

HuggingFaceDocBuilderDev · 2025-08-27T15:14:24Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

src/transformers/models/qwen2_5_omni/modeling_qwen2_5_omni.py

tests/models/qwen2_5_omni/test_modeling_qwen2_5_omni.py

zucchini-nlp · 2025-08-28T09:02:20Z

tests/models/qwen2_5_vl/test_modeling_qwen2_5_vl.py

+        # Default vision config is None to avoid a mutable default argument
+        if vision_config is None:
+            vision_config = {
+                "depth": 2,


should not be a problem as long as the QwenConfig takes the dict and creates a new VisionConfig from it, but agreed that mutable defaults is not a good idea in general

Yeah it's kind of a nit, I thought it was the problem initially and it was not, but since it's fixed might as well leave it!

Let's avoid any mutable type in the arguments 🙏 so keep as it is in this PR

tests/models/qwen2_5_omni/test_modeling_qwen2_5_omni.py

ydshieh · 2025-08-28T14:49:32Z

Although you are a AMD fans, don't forget to use run-slow (unless you feel strong it's not necessary in some cases)

remi-or · 2025-09-01T09:47:27Z

tests/models/qwen2_5_vl/test_modeling_qwen2_5_vl.py


-        EXPECTED_DECODED_TEXT = [
-            'system\nYou are a helpful assistant.\nuser\nWhat kind of dog is this?\nassistant\nThe dog in the picture appears to be a Labrador Retriever. Labradors are known for their friendly and energetic nature, which is evident in',
-            "system\nYou are a helpful assistant.\nuser\nWho are you?\nassistant\n�\n\n addCriterion\nI'm sorry, but I don't understand your question. Could you please provide more context or clarify what you're asking",


@zucchini-nlp the second answer seems wrong to me, it used to be I am Qwen, a large language model created by Alibaba Cloud. I am designed to answer a wide range of questions and provide information on various topics which makes more sense. I added a #FIXME because it seems out of scope for this PR

yeah, this is quite similar to the generation from non-FA2 model a few lines above, though I have no idea when it got from I am Qwen to addCriterion

Thanks, will need to go back in git history to see what changed

github-actions · 2025-09-01T11:16:44Z

This comment contains run-slow, running the specified jobs:

models: ['models/gemma3', 'models/qwen2_5_omni', 'models/qwen2_5_vl']
quantizations: [] ...

zucchini-nlp

approving in advance, so it can be merged as long as the slow tests show ✅

remi-or · 2025-09-01T15:12:22Z

run-slow: gemma3, qwen2_5_omni, qwen2_5_vl

github-actions · 2025-09-01T15:13:48Z

This comment contains run-slow, running the specified jobs:

models: ['models/gemma3', 'models/qwen2_5_omni', 'models/qwen2_5_vl']
quantizations: [] ...

remi-or · 2025-09-01T18:20:29Z

run-slow: gemma3, qwen2_5_omni, qwen2_5_vl

github-actions · 2025-09-01T18:20:39Z

[For maintainers] Suggested jobs to run (before merge)

run-slow: gemma3, qwen2_5_omni, qwen2_5_vl

github-actions · 2025-09-01T18:21:56Z

This comment contains run-slow, running the specified jobs:

models: ['models/gemma3', 'models/qwen2_5_omni', 'models/qwen2_5_vl']
quantizations: [] ...

remi-or · 2025-09-01T18:47:42Z

There are slow tests failing for

gemma3 which are related to Support batch size > 1 image-text inference #36682 (I think) so not this PR
qwen2_5_vl which has been adressed in comments above and unrelated to this PR
So merging this and adressing them down the line

remi-or requested a review from zucchini-nlp August 27, 2025 15:04

remi-or force-pushed the sweep-amd-ci-fa branch from 31df0bd to 3b2d07a Compare August 27, 2025 15:34

zucchini-nlp reviewed Aug 28, 2025

View reviewed changes

ydshieh reviewed Aug 28, 2025

View reviewed changes

tests/models/qwen2_5_omni/test_modeling_qwen2_5_omni.py Outdated Show resolved Hide resolved

zucchini-nlp mentioned this pull request Aug 28, 2025

[Qwen Omni/VL] Fix fa tests #40528

Merged

remi-or force-pushed the sweep-amd-ci-fa branch 2 times, most recently from ee12748 to 5f91da4 Compare September 1, 2025 09:43

remi-or commented Sep 1, 2025

View reviewed changes

zucchini-nlp approved these changes Sep 1, 2025

View reviewed changes

remi-or force-pushed the sweep-amd-ci-fa branch from 58b11bc to e27a65d Compare September 1, 2025 15:10

remi-or added 11 commits September 1, 2025 17:10

Expectations for gemma3

617560e

Fixes for Qwen2_5_VL tests

b0b652e

Added expectation but underlying pb is still there

14c2daf

Better handling of mrope section for Qwen2_5_vl

0dee25a

Fixes for FA2 tests and reformat batch test for Qwen2_5_Omni

77838cb

Fix multi-device error in qwen2_5_omni

b5031e3

Styel and repo-consistency

7a67548

Removed inherited test because fix in common

7f0bdce

slow tests fixes

f8cf1d2

Style

a0c2aef

Fixes for qwen2_5_vl or omni for FA test

27801c5

remi-or force-pushed the sweep-amd-ci-fa branch from e27a65d to 27801c5 Compare September 1, 2025 18:19

remi-or merged commit 514b3e8 into huggingface:main Sep 1, 2025
24 of 25 checks passed

Conversation

remi-or commented Aug 27, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Aug 27, 2025

Uh oh!

Uh oh!

Uh oh!

zucchini-nlp Aug 28, 2025

Choose a reason for hiding this comment

Uh oh!

remi-or Aug 28, 2025

Choose a reason for hiding this comment

Uh oh!

ydshieh Aug 28, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

ydshieh commented Aug 28, 2025

Uh oh!

remi-or Sep 1, 2025

Choose a reason for hiding this comment

Uh oh!

zucchini-nlp Sep 1, 2025

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Sep 1, 2025

Uh oh!

zucchini-nlp left a comment

Choose a reason for hiding this comment

Uh oh!

remi-or commented Sep 1, 2025

Uh oh!

github-actions bot commented Sep 1, 2025

Uh oh!

remi-or commented Sep 1, 2025

Uh oh!

github-actions bot commented Sep 1, 2025

Uh oh!

github-actions bot commented Sep 1, 2025

Uh oh!

remi-or commented Sep 1, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Comments