[bugfix] Update modeling_llama.py so it skips keys correctly by HDCharles · Pull Request #36289 · huggingface/transformers

HDCharles · 2025-02-19T21:40:08Z

the llama model was using past_key_value and past_key_values interchangeably which caused issues because only one of those was actually skipped in _skip_keys_device_placement when both needed to be.

This PR changes it so only past_key_values is used.

without this fix, llama + torch.compile was having issues as in the surfaced issue below. Note, though this was surfaced due to a TorchAO recipe though the bug was unrelated to torchao

i think #35763 might have caused this bug but not sure

Fixes

pytorch/ao#1705

text models: @ArthurZucker (not sure if this is the right person)
@hmellor @Rocketknight1 (last 2 hf people to touch this code)
@SunMarc (previously worked on torchao hf stuff)

jerryzh168 · 2025-02-19T22:06:11Z

cc @SunMarc can you take a look?

HDCharles · 2025-02-20T01:59:37Z

are these test failures actually blocking? it seems like other models just have the same issue.

SunMarc · 2025-02-20T14:28:34Z

This PR that was merged today actually fixes the issue that you are having (Cache is not a nn.Module anymore, so it will be skipped automatically).

SunMarc · 2025-02-20T14:30:33Z

Still, it would be indeed be better to only keep one of them. @ArthurZucker why do we have past_key_value and past_key_values ?

HDCharles mentioned this pull request Feb 19, 2025

past_key_value(s) name inconsistency causing problems #36290

Closed

4 tasks

HDCharles closed this Mar 7, 2025

HDCharles force-pushed the main branch from fc7aa2e to 94ae1ba Compare March 7, 2025 17:51

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[bugfix] Update modeling_llama.py so it skips keys correctly#36289

[bugfix] Update modeling_llama.py so it skips keys correctly#36289
HDCharles wants to merge 0 commit intohuggingface:mainfrom
HDCharles:main

HDCharles commented Feb 19, 2025 •

edited

Loading

Uh oh!

jerryzh168 commented Feb 19, 2025

Uh oh!

HDCharles commented Feb 20, 2025

Uh oh!

SunMarc commented Feb 20, 2025

Uh oh!

SunMarc commented Feb 20, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

HDCharles commented Feb 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jerryzh168 commented Feb 19, 2025

Uh oh!

HDCharles commented Feb 20, 2025

Uh oh!

SunMarc commented Feb 20, 2025

Uh oh!

SunMarc commented Feb 20, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

HDCharles commented Feb 19, 2025 •

edited

Loading