Fix cache-related tests by zucchini-nlp · Pull Request #39676 · huggingface/transformers

zucchini-nlp · 2025-07-25T15:32:43Z

What does this PR do?

As per title, failing after latest cache compatibility PR

zucchini-nlp · 2025-07-25T15:42:31Z

run-slow: kyutai_speech_to_text, musicgen_melody, qwen2_5_omni, rag, roformer, superglue

github-actions · 2025-07-25T15:43:56Z

This comment contains run-slow, running the specified jobs:

models: ['models/kyutai_speech_to_text', 'models/musicgen_melody', 'models/qwen2_5_omni', 'models/rag', 'models/roformer', 'models/superglue']
quantizations: [] ...

HuggingFaceDocBuilderDev · 2025-07-25T15:46:38Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

zucchini-nlp · 2025-07-28T09:54:41Z

run-slow: kyutai_speech_to_text, musicgen, musicgen_melody, rag, roformer, superglue

github-actions · 2025-07-28T09:56:20Z

This comment contains run-slow, running the specified jobs:

models: ['models/kyutai_speech_to_text', 'models/musicgen', 'models/musicgen_melody', 'models/rag', 'models/roformer', 'models/superglue']
quantizations: [] ...

github-actions · 2025-07-28T11:10:10Z

[For maintainers] Suggested jobs to run (before merge)

run-slow: kyutai_speech_to_text, llava_next, llava_next_video, musicgen, musicgen_melody, qwen2_5_omni, qwen2_5_vl, qwen2_vl, rag, roformer, superglue

ydshieh

Not very familiar with some changes in src/transformers so I wll rely on CI and trust you. Just a few nit questions to get some idea.

Thank you!

Could you link the PR you mentioned in this PR description?

ydshieh · 2025-07-28T13:17:01Z

do you know what causse this changes?

yep, after #39374 we started using higher default max length. Text-only generation pipe already uses it, so it's fine

ydshieh · 2025-07-28T13:18:38Z

                logits_padded = res_padded.logits[inputs_dict["attention_mask"].bool()]
                logits_padfree = res_padfree.logits[0]

-                torch.testing.assert_close(logits_padded.argmax(-1), logits_padfree.argmax(-1), rtol=0, atol=0)


ok, can't find it on the common test file, so fine.

this one doesn't make sense because we want to check logits, not sampled argmax tokens. Even tiny diff in logits can give different tokens, and the next line check with torch.allclose is enough

ydshieh · 2025-07-28T13:20:36Z

+            # Apply RoPE if self attention
+            if not is_cross_attention and sinusoidal_pos is not None:
+                if self.rotary_value:
+                    query_layer, key_layer, value_layer = self.apply_rotary_position_embeddings(
+                        sinusoidal_pos, query_layer, key_layer, value_layer
+                    )
+                else:
+                    query_layer, key_layer = self.apply_rotary_position_embeddings(
+                        sinusoidal_pos, query_layer, key_layer
+                    )


was it deleted at some point and here you just add it back?

yep, accidentally deleted RoPE 🙈

* fix * fix kyutai at last * fix unrelated tests and copies * update musicgen as well * revert tensor * fix old test failures * why it wasn't added?

zucchini-nlp added 2 commits July 25, 2025 16:21

fix

ef1ebd9

fix kyutai at last

00d9b76

zucchini-nlp added 3 commits July 28, 2025 11:41

fix unrelated tests and copies

3df3f95

update musicgen as well

daeb4d8

Merge remote-tracking branch 'upstream/main' into fix-ci-cache

fb9534b

zucchini-nlp requested a review from ydshieh July 28, 2025 09:52

revert tensor

56758c9

zucchini-nlp added 3 commits July 28, 2025 13:08

fix old test failures

86fafef

Merge branch 'main' into fix-ci-cache

008d93f

why it wasn't added?

6d34e6f

ydshieh approved these changes Jul 28, 2025

View reviewed changes

zucchini-nlp added the for patch Tag issues / labels that should be included in the next patch label Jul 28, 2025

zucchini-nlp merged commit 1c6b474 into huggingface:main Jul 28, 2025
26 checks passed

ArthurZucker pushed a commit that referenced this pull request Jul 29, 2025

Fix cache-related tests (#39676)

457b478

* fix * fix kyutai at last * fix unrelated tests and copies * update musicgen as well * revert tensor * fix old test failures * why it wasn't added?

Conversation

zucchini-nlp commented Jul 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Uh oh!

zucchini-nlp commented Jul 25, 2025

Uh oh!

github-actions Bot commented Jul 25, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Jul 25, 2025

Uh oh!

zucchini-nlp commented Jul 28, 2025

Uh oh!

github-actions Bot commented Jul 28, 2025

Uh oh!

github-actions Bot commented Jul 28, 2025

Uh oh!

ydshieh left a comment

Choose a reason for hiding this comment

Uh oh!

ydshieh Jul 28, 2025

Choose a reason for hiding this comment

Uh oh!

zucchini-nlp Jul 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ydshieh Jul 28, 2025

Choose a reason for hiding this comment

Uh oh!

zucchini-nlp Jul 28, 2025

Choose a reason for hiding this comment

Uh oh!

ydshieh Jul 28, 2025

Choose a reason for hiding this comment

Uh oh!

zucchini-nlp Jul 28, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

zucchini-nlp commented Jul 25, 2025 •

edited

Loading

zucchini-nlp Jul 28, 2025 •

edited

Loading