Fix mamba caches by manueldeprada · Pull Request #40196 · huggingface/transformers

manueldeprada · 2025-08-15T11:05:42Z

This PR takes as base v4.55.2 and adds the fix for mamba-style caches from #39797

Tested with vllm-main.

… option (huggingface#39953) * Fix MXFP4 quantizer validation to enable CPU dequantization Move dequantize check before CUDA availability check to allow CPU inference when quantization_config.dequantize is True. This enables users to run MXFP4 models on CPU by automatically converting them to BF16 format. * Add tests for MXFP4 quantizer CPU dequantization validation * fix: format mxfp4 test file with ruff

Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>

* remove dep * style * rm import * fix * style * simplify * style

fix

* Fix missing video inputs for PerceptionLM. * Minor fix for vanilla input image (only C,H,W, no tiles dim). * Revert "Minor fix for vanilla input image (only C,H,W, no tiles dim)." This reverts commit 181d87b.

* fix fuyu Signed-off-by: Isotr0py <2037008807@qq.com> * oops Signed-off-by: Isotr0py <2037008807@qq.com> * run test on GPU Signed-off-by: Isotr0py <mozf@mail2.sysu.edu.cn> * clean unused Signed-off-by: Isotr0py <mozf@mail2.sysu.edu.cn> * revert Signed-off-by: Isotr0py <mozf@mail2.sysu.edu.cn> * add fuyu multimodal test Signed-off-by: Isotr0py <2037008807@qq.com> * fix Signed-off-by: Isotr0py <2037008807@qq.com> --------- Signed-off-by: Isotr0py <2037008807@qq.com> Signed-off-by: Isotr0py <mozf@mail2.sysu.edu.cn>

…mask (huggingface#39991) (huggingface#40024) * Fix missing None default values for Gemma3n model in get_placeholder_mask (huggingface#39991) * Switched definition of optional from| None to Optiona[] (Issue huggingface#39991) --------- Co-authored-by: Laurenz Ruzicka <Laurenz.Ruzicka@ait.ac.at>

…ce#39986) * fix: resolve triton version check compatibility on windows * style: remove trailing space * fix: fix typo --------- Co-authored-by: Mohamed Mekkouri <93391238+MekkCyber@users.noreply.github.com>

* fix * update integration tests * fmt * add regression test

* default to dq if cpu * an other check * style * revert some changes

…gface#39975) * [bugfix] ensure correct tensor device in Idefics2, Idefics3, and SmolVLM models * to cuda

LysandreJik and others added 17 commits August 5, 2025 18:09

Release: v4.55.0

06f8004

[CI] post-GptOss fixes for green CI (huggingface#39929)

daab2db

Enable gpt-oss mxfp4 on older hardware (sm75+) (huggingface#39940)

cc98f42

Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com>

remove triton_kernels dep with kernels instead (huggingface#39926)

382717e

* remove dep * style * rm import * fix * style * simplify * style

[Idefics] fix device mismatch (huggingface#39981)

1d42803

fix

Fix missing video inputs for PerceptionLM. (huggingface#39971)

0d9032a

* Fix missing video inputs for PerceptionLM. * Minor fix for vanilla input image (only C,H,W, no tiles dim). * Revert "Minor fix for vanilla input image (only C,H,W, no tiles dim)." This reverts commit 181d87b.

fix: resolve triton version check compatibility on windows (huggingfa…

b8e97fb

…ce#39986) * fix: resolve triton version check compatibility on windows * style: remove trailing space * fix: fix typo --------- Co-authored-by: Mohamed Mekkouri <93391238+MekkCyber@users.noreply.github.com>

[GPT Big Code] Fix attention scaling (huggingface#40041)

0d69080

* fix * update integration tests * fmt * add regression test

Default to dequantize if cpu in device_map for mxfp4 (huggingface#39993)

99404c7

* default to dq if cpu * an other check * style * revert some changes

fix merge conlicts

79a9ffc

[bugfix] Fix tensor device in Idefics2, Idefics3, and SmolVLM (huggin…

956be23

…gface#39975) * [bugfix] ensure correct tensor device in Idefics2, Idefics3, and SmolVLM models * to cuda

v4.55.1

ea2eee0

qfix bad cherry-pick

aaa3169

v4.55.2

acf295a

manueldeprada added the for patch Tag issues / labels that should be included in the next patch label Aug 15, 2025

fix mamba models caches inheritance

56a7903

manueldeprada force-pushed the fix-mamba-caches branch from df09a44 to 56a7903 Compare August 15, 2025 11:11

manueldeprada closed this Aug 15, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix mamba caches#40196

Fix mamba caches#40196
manueldeprada wants to merge 18 commits intohuggingface:mainfrom
manueldeprada:fix-mamba-caches

manueldeprada commented Aug 15, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

15 participants

Conversation

manueldeprada commented Aug 15, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

15 participants