Bump transformers to 4.56.1 by jackzhxng · Pull Request #136 · huggingface/optimum-executorch

jackzhxng · 2025-09-04T21:02:30Z

Some changes needed for the bump:

Use early initialization to populate cache layer info - [core] Refactor the Cache logic to make it simpler and more general transformers#39797
No more max seq length and batch size args in DecoderOnlyLM constructor
Hybrid cache is getting deprecated, with static cache serving the same purpose with per-layer configs - One cache class to rule them all transformers#40276

HuggingFaceDocBuilderDev · 2025-09-04T21:06:12Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

jackzhxng · 2025-09-04T21:42:04Z

Looks like we need to fix some stuff 😂

kimishpatel · 2025-09-15T21:53:04Z

+        )
+        num_heads = getattr(config, "num_key_value_heads", config.num_attention_heads)
+        head_dim = getattr(config, "head_dim", config.hidden_size // config.num_attention_heads)
+        self.early_initialization(


what is this doing?

(In pr description)

linked pr description says that cache_position will also be removed. Is it talking about cache maintaining cache_position? Something to keep track of

Oh it's the part on early_initialization - basically you need to call this to initialize some of the attributes you need on each cache layer

kimishpatel · 2025-09-15T21:57:47Z

+                args=(),
+                kwargs={
+                    "input_ids": input_ids,
+                    "cache_position": cache_position,
+                },


why this change

https://github.com/huggingface/transformers/pull/39836/files#diff-6a4faa71e9436fb9187908174ea7ae18774e4ad3ed0cda17a29edec657383a41L249-R254

This changed from args to kwargs in transformers

kimishpatel · 2025-09-15T21:59:20Z

ok looks largely benign as in things that have to be done to make it work. But I would suggest you put comments around the changes to make it easier to review. I dont know why all the changes are needed

Bump transformers to 4.56.1

93f4e34

jackzhxng requested review from kimishpatel and mergennachin September 4, 2025 21:02

kimishpatel approved these changes Sep 4, 2025

View reviewed changes

jackzhxng added 5 commits September 5, 2025 10:01

Try to fix tests

6d8c056

Merge branch 'huggingface:main' into jz/bump-transformers-2

b1d72b1

Fix tests

1e1e348

Fix multimodal tests

0adf61d

Hybrid cache is getting deprecated

3bcc8ef

kimishpatel reviewed Sep 15, 2025

View reviewed changes

jackzhxng merged commit 828ae02 into huggingface:main Sep 16, 2025
65 of 81 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bump transformers to 4.56.1#136

Bump transformers to 4.56.1#136
jackzhxng merged 6 commits intohuggingface:mainfrom
jackzhxng:jz/bump-transformers-2

jackzhxng commented Sep 4, 2025 •

edited

Loading

Uh oh!

HuggingFaceDocBuilderDev commented Sep 4, 2025

Uh oh!

jackzhxng commented Sep 4, 2025

Uh oh!

kimishpatel Sep 15, 2025

Uh oh!

jackzhxng Sep 15, 2025

Uh oh!

kimishpatel Sep 16, 2025

Uh oh!

jackzhxng Sep 16, 2025

Uh oh!

kimishpatel Sep 15, 2025

Uh oh!

jackzhxng Sep 16, 2025

Uh oh!

kimishpatel commented Sep 15, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

jackzhxng commented Sep 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Sep 4, 2025

Uh oh!

jackzhxng commented Sep 4, 2025

Uh oh!

kimishpatel Sep 15, 2025

Choose a reason for hiding this comment

Uh oh!

jackzhxng Sep 15, 2025

Choose a reason for hiding this comment

Uh oh!

kimishpatel Sep 16, 2025

Choose a reason for hiding this comment

Uh oh!

jackzhxng Sep 16, 2025

Choose a reason for hiding this comment

Uh oh!

kimishpatel Sep 15, 2025

Choose a reason for hiding this comment

Uh oh!

jackzhxng Sep 16, 2025

Choose a reason for hiding this comment

Uh oh!

kimishpatel commented Sep 15, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

jackzhxng commented Sep 4, 2025 •

edited

Loading