Fix: propagate quantization_config to text sub-config for composite models in AutoModelForCausalLM by lvliang-intel · Pull Request #45494 · huggingface/transformers

lvliang-intel · 2026-04-17T14:12:41Z

What does this PR do?

Fixes loading of quantized composite models (e.g. Qwen3.5-35B-A3B with AutoRound quantization) via AutoModelForCausalLM.from_pretrained.

Problem:
For composite models whose model_class.config_class maps to text_config, the from_pretrained method in auto_factory.py swaps the full composite config with the text sub-config:

if model_class.config_class == config.sub_configs.get("text_config", None):
    config = config.get_text_config()

The quantization_config is stored on the top-level composite config (from config.json), but not on the text sub-config. When modeling_utils.py later calls get_hf_quantizer, it checks hasattr(config, "quantization_config") to determine pre_quantized. Since the text sub-config lacks this attribute, pre_quantized is set to False, which causes a ValueError for quantization methods that require pre-quantized weights (e.g. AutoRound, GPTQ, AWQ).

ValueError: The quantization method QuantizationMethod.AUTOROUND does require the model to be pre-quantized.
You explicitly passed pre_quantized=False meaning your model weights are not quantized.

How to reproduce the issue:

qconfig = transformers.AutoRoundConfig()
model = transformers.AutoModelForCausalLM.from_pretrained(model_name_or_path="Intel/Qwen3.5-35B-A3B-int4-AutoRound", trust_remote_code=True, device_map="cuda", quantization_config=qconfig)

Fixes # (issue)

After swapping to the text sub-config, propagate quantization_config from the parent composite config if it exists and the text sub-config does not already have one.

Rocketknight1 · 2026-04-20T12:08:20Z

cc @SunMarc for quants

SunMarc

Thanks a lot ! Left a suggestion and a comment

…odels in AutoModelForCausalLM Signed-off-by: lvliang-intel <liang1.lv@intel.com>

Signed-off-by: lvliang-intel <liang1.lv@intel.com>

github-actions · 2026-04-21T02:05:35Z

[For maintainers] Suggested jobs to run (before merge)

run-slow: auto

HuggingFaceDocBuilderDev · 2026-04-21T12:32:21Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

SunMarc

Nice !

lvliang-intel mentioned this pull request Apr 19, 2026

Support new model Qwen/Qwen3.6-35B-A3B intel/auto-round#1705

Merged

9 tasks

SunMarc reviewed Apr 20, 2026

View reviewed changes

Comment thread src/transformers/models/auto/auto_factory.py

lvliang-intel added 2 commits April 21, 2026 10:03

Fix: propagate quantization_config to text sub-config for composite m…

3aa823f

…odels in AutoModelForCausalLM Signed-off-by: lvliang-intel <liang1.lv@intel.com>

fix comments

c9981a2

Signed-off-by: lvliang-intel <liang1.lv@intel.com>

lvliang-intel force-pushed the fix/propagate-quantization-config-to-text-subconfig branch from aaf3b31 to c9981a2 Compare April 21, 2026 02:04

SunMarc enabled auto-merge April 21, 2026 12:21

SunMarc approved these changes Apr 21, 2026

View reviewed changes

SunMarc added this pull request to the merge queue Apr 21, 2026

Merged via the queue into huggingface:main with commit 85099df Apr 21, 2026
28 checks passed

evalstate mentioned this pull request Apr 28, 2026

Cumulative defect fixes from recent Transformers PRs evalstate/transformers#41

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix: propagate quantization_config to text sub-config for composite models in AutoModelForCausalLM#45494

Fix: propagate quantization_config to text sub-config for composite models in AutoModelForCausalLM#45494
SunMarc merged 2 commits intohuggingface:mainfrom
lvliang-intel:fix/propagate-quantization-config-to-text-subconfig

lvliang-intel commented Apr 17, 2026

Uh oh!

Rocketknight1 commented Apr 20, 2026

Uh oh!

SunMarc left a comment

Uh oh!

Uh oh!

github-actions Bot commented Apr 21, 2026

Uh oh!

HuggingFaceDocBuilderDev commented Apr 21, 2026

Uh oh!

SunMarc left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

lvliang-intel commented Apr 17, 2026

What does this PR do?

Uh oh!

Rocketknight1 commented Apr 20, 2026

Uh oh!

SunMarc left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

github-actions Bot commented Apr 21, 2026

Uh oh!

HuggingFaceDocBuilderDev commented Apr 21, 2026

Uh oh!

SunMarc left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants