Fix HybridChunedCache & Llama4 by Cyrilvallez · Pull Request #38299 · huggingface/transformers

Cyrilvallez · 2025-05-22T15:12:04Z

What does this PR do?

As per the title!
It aligns HybridChunkedCache on what was done in #37972 (i.e. simply take the min between max_len and window), which was causing lengths mismatches with llama4 in get_mask_sizes with the default HybridChunkedCache!

ArthurZucker

Niiiice!

HuggingFaceDocBuilderDev · 2025-05-22T15:30:02Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

Cyrilvallez added 2 commits May 22, 2025 17:11

Update cache_utils.py

9c06457

Update cache_utils.py

29a360f

ArthurZucker reviewed May 22, 2025

View reviewed changes

ArthurZucker approved these changes May 22, 2025

View reviewed changes

Cyrilvallez merged commit 73286d8 into main May 22, 2025
21 checks passed

Cyrilvallez deleted the fix-hybrid-chunked-cache branch May 22, 2025 15:25

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix HybridChunedCache & Llama4#38299

Fix HybridChunedCache & Llama4#38299
Cyrilvallez merged 2 commits intomainfrom
fix-hybrid-chunked-cache

Cyrilvallez commented May 22, 2025 •

edited

Loading

Uh oh!

ArthurZucker left a comment

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented May 22, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

Cyrilvallez commented May 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Uh oh!

ArthurZucker left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented May 22, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Cyrilvallez commented May 22, 2025 •

edited

Loading