Fix re-compilations for cross attention cache by zucchini-nlp · Pull Request #39788 · huggingface/transformers

zucchini-nlp · 2025-07-30T11:53:37Z

What does this PR do?

As per title, if we are using the legacy cache.key_cache[layer_idx] a warning is emitted and fullgraph compilation breaks. This PR makes sure no warning are raised when using the models in core library

github-actions · 2025-07-30T11:54:38Z

[For maintainers] Suggested jobs to run (before merge)

run-slow: autoformer, bert, bert_generation, big_bird, bigbird_pegasus, blip, bridgetower, camembert, data2vec, electra, ernie, fsmt, gpt_bigcode, imagegpt, kosmos2, led

manueldeprada

lgtm, sorry!! these changes got lost when cherry-picking back and forth between the layer[i].keys and key_cache[i] designs in the original PR😭

HuggingFaceDocBuilderDev · 2025-07-30T12:06:57Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

zucchini-nlp · 2025-07-30T12:13:19Z

No worries, that happens 😄

Let me see if I can add encoder-decoder compile test easily in this PR or if we need to handle a lot of edge cases

EDIT: oh these aren't generative models/can't compile fullgraph and we don't have graph-break test for those models yet. That's why it wasn't caught in CI

fix recompilations for cross attn cache

fix recompilations for cross attn cache

018bfe8

zucchini-nlp requested review from gante and manueldeprada July 30, 2025 11:53

manueldeprada approved these changes Jul 30, 2025

View reviewed changes

zucchini-nlp merged commit 8e077a3 into huggingface:main Jul 30, 2025
25 checks passed

zaristei pushed a commit to zaristei/transformers that referenced this pull request Sep 9, 2025

Fix re-compilations for cross attention cache (huggingface#39788)

c96cf79

fix recompilations for cross attn cache

zaristei pushed a commit to zaristei/transformers that referenced this pull request Sep 9, 2025

Fix re-compilations for cross attention cache (huggingface#39788)

f80bf67

fix recompilations for cross attn cache

zaristei pushed a commit to zaristei/transformers that referenced this pull request Sep 9, 2025

Fix re-compilations for cross attention cache (huggingface#39788)

5786cb8

fix recompilations for cross attn cache

zaristei pushed a commit to zaristei/transformers that referenced this pull request Sep 9, 2025

Fix re-compilations for cross attention cache (huggingface#39788)

3e04f6b

fix recompilations for cross attn cache

zaristei pushed a commit to zaristei/transformers that referenced this pull request Sep 9, 2025

Fix re-compilations for cross attention cache (huggingface#39788)

3a8f734

fix recompilations for cross attn cache

zaristei pushed a commit to zaristei/transformers that referenced this pull request Sep 9, 2025

Fix re-compilations for cross attention cache (huggingface#39788)

73ba684

fix recompilations for cross attn cache

zaristei pushed a commit to zaristei/transformers that referenced this pull request Sep 9, 2025

Fix re-compilations for cross attention cache (huggingface#39788)

15db1f8

fix recompilations for cross attn cache

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix re-compilations for cross attention cache#39788

Fix re-compilations for cross attention cache#39788
zucchini-nlp merged 1 commit intohuggingface:mainfrom
zucchini-nlp:cache-cross-attn-compile

zucchini-nlp commented Jul 30, 2025 •

edited

Loading

Uh oh!

github-actions Bot commented Jul 30, 2025

Uh oh!

manueldeprada left a comment

Uh oh!

HuggingFaceDocBuilderDev commented Jul 30, 2025

Uh oh!

zucchini-nlp commented Jul 30, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

zucchini-nlp commented Jul 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Uh oh!

github-actions Bot commented Jul 30, 2025

Uh oh!

manueldeprada left a comment

Choose a reason for hiding this comment

Uh oh!

HuggingFaceDocBuilderDev commented Jul 30, 2025

Uh oh!

zucchini-nlp commented Jul 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

zucchini-nlp commented Jul 30, 2025 •

edited

Loading

zucchini-nlp commented Jul 30, 2025 •

edited

Loading