Test: generate with `torch.compile(model.forward)` as a fast test by gante · Pull Request #34544 · huggingface/transformers

gante · 2024-10-31T18:25:48Z

What does this PR do?

Follow-up to #34464

This PR:

Converts test_generate_compile_model_forward to a fast test. This means we will check generate with torch.compile(model.forward) at each commit on ALL models that support StaticCache 💛
Fixes failing cases of test_generate_compile_model_forward whenever possible
Tags models with _supports_static_cache = False #Reason when the model doesn't support torch.compile(model.forward)

✅ py.test tests/models/ -k test_generate_compile is all green, takes ~2 mins to run on all models on my machine

ydshieh

Love this!

Q: Is it really fast ...?

Remark: I feel get_max_cache_length is a better name than get_max_cache_shape but OK I know not great to change name all the time.

gante · 2024-10-31T18:41:25Z

Q: Is it really fast ...?

@ydshieh yes :D

HuggingFaceDocBuilderDev · 2024-10-31T18:53:05Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

ArthurZucker

I don't mind, tho I don't think our priority should be this (full compile vs compile forward in generate!) + I don't see the test being run in the CI! 🤗

ArthurZucker · 2024-11-05T10:03:52Z

Could you just make sure it's run

ydshieh2 · 2024-11-05T11:16:43Z

We need to remove @require_torch_gpu too for def test_generate_compile

ydshieh · 2025-01-23T13:29:46Z

Before merge, feel free to ping me for a check for (if there is any) flakyness :-) or anything you think I can double check again.

ArthurZucker

Thanks can ignore my comments and merge 🤗

ArthurZucker · 2025-01-27T16:43:38Z

is it possible for the HybridCache to inherit from Static cache?

We might just need an extra class that says CompileCompatible , someone wanted is_static attr˜!

gante · 2025-01-27T18:04:16Z

(sorry, the PR is not ready yet, a few cases are still failing 👀 I didn't mean to request a review)

gante · 2025-01-28T12:49:58Z

Now it's working on all models, including encoder-decoder + cache 🤗

It's not too heavy on our CI, it should add ~2 mins if all models are run. And it should prevent us from many headaches! As we can see in diff, we had compilation enabled for a bunch of models that don't support it.

…ggingface#34544)

fix tests

c9e3ed6

gante requested review from ArthurZucker and ydshieh October 31, 2024 18:25

gante commented Oct 31, 2024

View reviewed changes

Comment thread tests/models/chameleon/test_modeling_chameleon.py Outdated

ydshieh reviewed Oct 31, 2024

View reviewed changes

Comment thread tests/generation/test_utils.py Outdated

ydshieh approved these changes Oct 31, 2024

View reviewed changes

ArthurZucker reviewed Nov 5, 2024

View reviewed changes

Merge branch 'main' into generate_forward_compile_fix

04d5adf

gante commented Jan 22, 2025

View reviewed changes

Comment thread tests/generation/test_utils.py Outdated

gante and others added 4 commits January 22, 2025 11:42

Update tests/generation/test_utils.py

d789a57

make fixup

43f96af

tmp commit

3f165c8

rely on auto compilation for the tests

524b3cb

ArthurZucker approved these changes Jan 27, 2025

View reviewed changes

gante and others added 5 commits January 27, 2025 19:32

fix a few more cases (a few to go)

ab51b67

all working :D

7777cf1

Merge branch 'main' into generate_forward_compile_fix

16804ea

make fixup

0753963

add compile cache reset

4430684

gante and others added 2 commits January 28, 2025 13:50

allow compilation on cpu

bc4e8b3

Merge branch 'main' into generate_forward_compile_fix

f22e86b

gante merged commit ece8c42 into huggingface:main Jan 28, 2025

gante deleted the generate_forward_compile_fix branch January 28, 2025 14:10

This was referenced Jan 31, 2025

[Moshi] disable automatic compilation #35986

Closed

[generation] automatic compilation fixes: don't log when output_attentions is True #35989

Closed

[Moshi] disable automatic compilation if the model can't compile #35992

Merged

bursteratom pushed a commit to bursteratom/transformers that referenced this pull request Jan 31, 2025

Test: generate with torch.compile(model.forward) as a fast test (hu…

b264bcf

…ggingface#34544)

elvircrn pushed a commit to elvircrn/transformers that referenced this pull request Feb 13, 2025

Test: generate with torch.compile(model.forward) as a fast test (hu…

7e50e54

…ggingface#34544)

sbucaille pushed a commit to sbucaille/transformers that referenced this pull request Feb 16, 2025

Test: generate with torch.compile(model.forward) as a fast test (hu…

8cb5e65

…ggingface#34544)

Conversation

gante commented Oct 31, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Uh oh!

Uh oh!

Uh oh!

ydshieh left a comment

Choose a reason for hiding this comment

Uh oh!

gante commented Oct 31, 2024

Uh oh!

HuggingFaceDocBuilderDev commented Oct 31, 2024

Uh oh!

ArthurZucker left a comment

Choose a reason for hiding this comment

Uh oh!

ArthurZucker commented Nov 5, 2024

Uh oh!

ydshieh2 commented Nov 5, 2024

Uh oh!

Uh oh!

ydshieh commented Jan 23, 2025

Uh oh!

ArthurZucker left a comment

Choose a reason for hiding this comment

Uh oh!

ArthurZucker Jan 27, 2025

Choose a reason for hiding this comment

Uh oh!

ArthurZucker Jan 27, 2025

Choose a reason for hiding this comment

Uh oh!

gante commented Jan 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gante commented Jan 28, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

gante commented Oct 31, 2024 •

edited

Loading

gante commented Jan 27, 2025 •

edited

Loading