Fix fp8 UT bugs by YangKai0616 · Pull Request #42442 · huggingface/transformers

YangKai0616 · 2025-11-27T07:06:12Z

What does this PR do?

When running the tests with the command pytest -rA tests/quantization/finegrained_fp8/test_fp8.py, the current output reports the following errors:

FAILED tests/quantization/finegrained_fp8/test_fp8.py::FP8QuantizerTest::test_quantized_model - AssertionError: 'Once upon a time, there was a man who was a farmer.' != 'Once upon a time, there was a man who was very rich.'
FAILED tests/quantization/finegrained_fp8/test_fp8.py::FP8QuantizerTest::test_quantized_model_conversion - AssertionError: 122 != 146
FAILED tests/quantization/finegrained_fp8/test_fp8.py::FP8QuantizerTest::test_quantized_model_multi_accelerator - AssertionError: 'Once upon a time, there was a man who was a farmer.' != 'Once upon a time, there was a man who was very rich.'
FAILED tests/quantization/finegrained_fp8/test_fp8.py::FP8QuantizerTest::test_save_pretrained - AssertionError: 'Once upon a time, there was a man who was a farmer.' != 'Once upon a time, there was a man who was very rich.'
FAILED tests/quantization/finegrained_fp8/test_fp8.py::FP8QuantizerTest::test_save_pretrained_offload - AssertionError: 'Once upon a time, there was a man who was a farmer.' != 'Once upon a time, there was a man who was very rich.'

The errors can be categorized into two types:

Model output does not match EXPECTED_OUTPUT

Root cause: The _dtype used meta model parameter dtype in here, while the EXPECTED_OUTPUT appears to be the result obtained with a fixed dtype of torch.float32. After I explicitly set _dtype to torch.float32, the result matches the expected output.

This PR updates the EXPECTED_OUTPUT accordingly. Please let me know your insights, thx!

nb_linears - 25 ≠ nb_fp8_linear

Root cause: The replacement operation in here can lead to patterns like XX.fc1.XX → XX.fc*.XX, in which case the layer name fc1 listed in modules_to_not_convert fails to match the actual layer.

This PR fixes the string matching logic.

YangKai0616 · 2025-11-27T07:36:10Z

@SunMarc , @ydshieh , please help review, thanks!

SunMarc

Thanks ! Left a comment

SunMarc · 2025-11-27T11:07:09Z


-        current_key_name_str = re.sub(r"\d+", "*", current_key_name)
-        if not any(key in current_key_name_str for key in (modules_to_not_convert or [])):
+        if not any(key in current_key_name_parts for key in (modules_to_not_convert or [])):


maybe we can use the same logic as mxfp4 ? We have the following function should_convert_module. Indeed here the logic here changed recently and this needs to be fixed

github-actions · 2025-11-27T12:47:01Z

[For maintainers] Suggested jobs to run (before merge)

run-slow: finegrained_fp8

SunMarc · 2025-11-27T14:23:24Z

This is a bit more complex to fix those due to the refactor, so I decided to open a PR to fix those. I've added you as a co-author. Thanks for running those tests ;)

YangKai0616 · 2025-11-28T02:51:12Z

Alright, I will follow the changes in the new PR and close this PR.

YangKai0616 added 2 commits November 27, 2025 06:41

Fix fp8 UT bugs

aee5de0

Make quality

e95a2ee

SunMarc reviewed Nov 27, 2025

View reviewed changes

YangKai0616 added 2 commits November 27, 2025 12:45

Update the judgment logic

037ad45

Merge branch 'main' into main

ccc765d

SunMarc mentioned this pull request Nov 27, 2025

Fix fp8 + some enhancement #42455

Merged

YangKai0616 closed this Nov 28, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix fp8 UT bugs#42442

Fix fp8 UT bugs#42442
YangKai0616 wants to merge 4 commits intohuggingface:mainfrom
YangKai0616:main

YangKai0616 commented Nov 27, 2025

Uh oh!

YangKai0616 commented Nov 27, 2025

Uh oh!

SunMarc left a comment

Uh oh!

SunMarc Nov 27, 2025

Uh oh!

YangKai0616 Nov 27, 2025

Uh oh!

github-actions Bot commented Nov 27, 2025

Uh oh!

SunMarc commented Nov 27, 2025

Uh oh!

YangKai0616 commented Nov 28, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

YangKai0616 commented Nov 27, 2025

What does this PR do?

Uh oh!

YangKai0616 commented Nov 27, 2025

Uh oh!

SunMarc left a comment

Choose a reason for hiding this comment

Uh oh!

SunMarc Nov 27, 2025

Choose a reason for hiding this comment

Uh oh!

YangKai0616 Nov 27, 2025

Choose a reason for hiding this comment

Uh oh!

github-actions Bot commented Nov 27, 2025

Uh oh!

SunMarc commented Nov 27, 2025

Uh oh!

YangKai0616 commented Nov 28, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants