Raise explicit error when FP8 is requested via from_config by dikshyantacharya · Pull Request #42865 · huggingface/transformers

dikshyantacharya · 2025-12-14T21:59:08Z

Summary

from_config() currently ignores FP8 intent specified via quantization_config, silently falling back to FP32.
This PR makes the behavior explicit by raising a clear error instead.

What this PR does

Raises NotImplementedError when FP8 is requested via from_config()
Adds a regression test to lock in the behavior

Why

Prevents silent misconfiguration
Improves API correctness
Establishes a safe foundation for future FP8 support

Scope

This PR does not enable FP8 kernels or backend support.
It focuses solely on correctness and explicit behavior.

MekkCyber

Thanks for the PR!

MekkCyber · 2025-12-15T09:12:19Z

+        if quant_config is not None:
+            if quant_config.get("quant_method") == "fp8":
+                raise NotImplementedError(
+                    "FP8 via `from_config()` is not yet supported. "
+                    "FP8 models must be created via `from_pretrained()` with an FP8-capable backend."
+                )


let's make it general and not only for fp8

MekkCyber · 2025-12-15T09:13:07Z

+import pytest
+
+from transformers import AutoConfig, AutoModel
+
+
+def test_fp8_from_config_raises():
+    config = AutoConfig.from_pretrained("gpt2")
+    config.quantization_config = {"quant_method": "fp8"}
+
+    with pytest.raises(NotImplementedError, match="FP8 via"):
+        AutoModel.from_config(config)


could you add the test to the quantization tests instead ?

github-actions · 2025-12-15T18:17:49Z

[For maintainers] Suggested jobs to run (before merge)

run-slow: config

github-actions · 2025-12-15T18:35:17Z

View the CircleCI Test Summary for this PR:

https://huggingface.co/spaces/transformers-community/circle-ci-viz?pr=42865&sha=417544

dikshyantacharya force-pushed the fp8-from-config branch 2 times, most recently from 0b2d42a to d86e813 Compare December 14, 2025 22:34

dikshyantacharya mentioned this pull request Dec 14, 2025

[Quantization FP8] Native from_config support #42804

Open

MekkCyber reviewed Dec 15, 2025

View reviewed changes

dikshyantacharya force-pushed the fp8-from-config branch 6 times, most recently from fd7022b to 8d39746 Compare December 15, 2025 18:16

Raise error when quantization_config is passed to from_config

fcd2e2d

dikshyantacharya force-pushed the fp8-from-config branch from 68a6441 to fcd2e2d Compare December 15, 2025 18:25

Merge branch 'main' into fp8-from-config

4175448

This was referenced Apr 29, 2026

Cumulative feature and defect updates from recent Transformers PRs evalstate/transformers#42

Open

Cumulative defect fixes from recent Transformers PRs evalstate/transformers#43

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Raise explicit error when FP8 is requested via from_config#42865

Raise explicit error when FP8 is requested via from_config#42865
dikshyantacharya wants to merge 2 commits intohuggingface:mainfrom
dikshyantacharya:fp8-from-config

dikshyantacharya commented Dec 14, 2025

Uh oh!

MekkCyber left a comment

Uh oh!

MekkCyber Dec 15, 2025

Uh oh!

MekkCyber Dec 15, 2025

Uh oh!

github-actions Bot commented Dec 15, 2025

Uh oh!

github-actions Bot commented Dec 15, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

dikshyantacharya commented Dec 14, 2025

Summary

What this PR does

Why

Scope

Uh oh!

MekkCyber left a comment

Choose a reason for hiding this comment

Uh oh!

MekkCyber Dec 15, 2025

Choose a reason for hiding this comment

Uh oh!

MekkCyber Dec 15, 2025

Choose a reason for hiding this comment

Uh oh!

github-actions Bot commented Dec 15, 2025

Uh oh!

github-actions Bot commented Dec 15, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants