Registers StaticCache serialization functions for torch.export.export by xadupre · Pull Request #39931 · huggingface/transformers

xadupre · 2025-08-05T18:04:51Z

What does this PR do?

Registers serialization functions for StaticCache. After this, torch.export.export does not need extra work to export a model using a StaticCache.

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

xadupre · 2025-08-05T18:08:15Z

cc @yuanyao-nv
cc @tugsbayasgalan, this PR should simplify the export for Executorch + StaticCache

yuanyao-nv · 2025-08-05T19:59:03Z

@xadupre Is the intended usage to go through executorch after this change? We previously found that the executorch wrapper registers past k/v as buffers, and as a result the exported ONNX has them as initializers.

xadupre · 2025-08-05T22:07:39Z

It is not. I wanted to simplify the wrapper around executorch because I need their code to patch vmap and write a full test with dynamic shapes.

tugsbayasgalan · 2025-08-06T01:30:27Z

@gante Is cache_utils.py the right place to put all export + caching related logic? I also want to add similar support for MambaCache but executorch doesn't feel like the right place to put it since it will cover other export users like ONNX. Is there dedicated directory for maybe PT-2/compile related changes?

…to series

xadupre · 2025-08-06T13:16:51Z

@gante Is cache_utils.py the right place to put all export + caching related logic? I also want to add similar support for MambaCache but executorch doesn't feel like the right place to put it since it will cover other export users like ONNX. Is there dedicated directory for maybe PT-2/compile related changes?

After the serialization functions are registered for all the caches, classes such as TorchExportableModuleWithHybridCache should not be needed anymore.

…to series

Rocketknight1 · 2025-08-06T15:38:58Z

cc @gante for staticcache too!

tugsbayasgalan · 2025-08-06T17:29:31Z

@gante Is cache_utils.py the right place to put all export + caching related logic? I also want to add similar support for MambaCache but executorch doesn't feel like the right place to put it since it will cover other export users like ONNX. Is there dedicated directory for maybe PT-2/compile related changes?

After the serialization functions are registered for all the caches, classes such as TorchExportableModuleWithHybridCache should not be needed anymore.

@xadupre Just want to confirm! For individual cache classes like MambaCache, we still need to register them to pytree seperately right?

xadupre · 2025-08-06T17:42:40Z

Yes, the cache type is the key for the registration. If you have something like class MyDynamicCache(DynamicCache): pass, you need to registrer MyDynamicCache.

ArthurZucker

Sorry about the late review, was waiting because #39797 was just merged, which has lazy inits again (it added a way not to use it and touched the executorch integration!)

ArthurZucker · 2025-08-11T11:25:05Z

Can you make sure your rebase! 🤗

Cyrilvallez · 2025-08-15T08:30:28Z

Hey! Just a heads-up about #40075! Pytree registration conflicts with fsdp apparently - everything should be part of executorch integration only for this reason (and the fact that since it's specific to export, it should reside there as well 🤗)

justinchuby · 2025-08-21T16:41:45Z

@Cyrilvallez @ArthurZucker the fix is not specific to executorch, but is also needed for torch.export compatibility that is used by ONNX export and {xla etc.}. Is there a path to natively support this in transformers?

tugsbayasgalan · 2025-08-21T17:31:18Z

I think we need more general directory where we can put torch.export specific things. Executorch is one user of torch.export, so if we move the pytree registration into executorch directory, other export users can't use them easily.

Cyrilvallez · 2025-08-22T09:05:44Z

The fact is that export necessarily need to register correct mask function (the one without vmap) to work correctly. Otherwise export will fail. So IMO, for now at least it makes sense to have everything there

xadupre added 2 commits August 5, 2025 19:59

Registers StaticCache serialization functions for torch.export.export

77e824c

style

82339ff

xadupre mentioned this pull request Aug 5, 2025

Add serialization function for StaticCache #38879

Closed

5 tasks

xadupre added 2 commits August 6, 2025 14:45

Merge branch 'main' of https://github.com/huggingface/transformers in…

1ac350f

…to series

remove unit test on StaticCache with executorch

68e0b4b

xadupre added 4 commits August 6, 2025 15:26

Merge branch 'main' of https://github.com/huggingface/transformers in…

bdbcde4

…to series

disable one test for torch==2.7

6cb8b9e

Merge branch 'main' of https://github.com/huggingface/transformers in…

e95aaca

…to series

add EncoderDecoderCache

cacd7ba

xadupre mentioned this pull request Aug 7, 2025

Dynamo exporter patches huggingface/optimum-onnx#27

Closed

ArthurZucker reviewed Aug 11, 2025

View reviewed changes

evalstate mentioned this pull request Apr 29, 2026

Cumulative feature and defect updates from recent Transformers PRs evalstate/transformers#42

Open

Conversation

xadupre commented Aug 5, 2025

What does this PR do?

Before submitting

Who can review?

Uh oh!

xadupre commented Aug 5, 2025

Uh oh!

yuanyao-nv commented Aug 5, 2025

Uh oh!

xadupre commented Aug 5, 2025

Uh oh!

tugsbayasgalan commented Aug 6, 2025

Uh oh!

xadupre commented Aug 6, 2025

Uh oh!

Rocketknight1 commented Aug 6, 2025

Uh oh!

tugsbayasgalan commented Aug 6, 2025

Uh oh!

xadupre commented Aug 6, 2025

Uh oh!

ArthurZucker left a comment

Choose a reason for hiding this comment

Uh oh!

ArthurZucker commented Aug 11, 2025

Uh oh!

Cyrilvallez commented Aug 15, 2025

Uh oh!

justinchuby commented Aug 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tugsbayasgalan commented Aug 21, 2025

Uh oh!

Cyrilvallez commented Aug 22, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

justinchuby commented Aug 21, 2025 •

edited

Loading