Fix incompatible weight names by mengniwang95 · Pull Request #1759 · intel/auto-round

mengniwang95 · 2026-04-29T03:32:52Z

Description

transformers will do checkpoint name conversion automatically and then cause the mismatch between quantized weight name and original name

W4A16 Qwen2.5-VL-7B-Instruct generated by this PR can be loaded by sglang and transformers

Example for this issue (qwen2_5_vl):

The original weight names are:
"model.layers.0.mlp.down_proj.weight"
"visual.blocks.0.attn.proj.weight"
...

After AR quantiztaion, the saved weight names are:
"model.language_model.layers.0.mlp.down_proj.weight" (incorrect weight)
"model.visual.blocks.0.attn.proj.weight" (incorrect weight name)
"model.layers.0.mlp.down_proj.qweight"
...

the block_name_to_quantize is:
"block_name_to_quantize": [
"model.language_model.layers",
"model.layers"
],
Since the change of weight name, sglang can not load the quantized weight.

Using this PR, the AR generated weight names will be:
"model.layers.0.mlp.down_proj.qweight"
"visual.blocks.0.attn.proj.weight"
...
and the block_name_to_quantize is:
"block_name_to_quantize": "model.layers"

Type of Change

Bug fix

Related Issues

#982

Checklist Before Submitting

My code has been tested locally.
Documentation has been updated as needed.
New or updated tests are included where applicable.
The CUDA CI has passed. You can trigger it by commenting /azp run Unit-Test-CUDA-AutoRound.

Co-authored-by: Copilot <copilot@github.com>

for more information, see https://pre-commit.ci

Copilot

Pull request overview

Fixes weight-name mismatches caused by Transformers checkpoint key conversion (via _checkpoint_conversion_mapping), so exported quantized checkpoints/configs retain the expected “original” key names and can be loaded by downstream runtimes (e.g., sglang).

Changes:

Added utility helpers to apply/revert checkpoint conversion mappings for parameter/block names.
Applied checkpoint conversion mapping when deriving quant_block_list during inference-time model conversion.
Reverted checkpoint conversion mapping when saving shard tensors and when serializing to_quant_block_names into exported configs.

Reviewed changes

Copilot reviewed 7 out of 7 changed files in this pull request and generated 3 comments.

Show a summary per file

File	Description
auto_round/utils/common.py	Adds helpers to apply/revert checkpoint conversion regex mappings.
auto_round/inference/convert_model.py	Applies checkpoint conversion mapping when building quantization block prefixes.
auto_round/compressors_new/shard_writer.py	Reverts mapped names when writing shard tensor keys.
auto_round/compressors_new/base.py	Reverts mapped `to_quant_block_names` in exported serialization metadata.
auto_round/compressors/shard_writer.py	Reverts mapped names when writing shard tensor keys (old arch).
auto_round/compressors/base.py	Reverts mapped `to_quant_block_names` in exported serialization metadata (old arch).
auto_round/autoround.py	Attempts to merge `extra_config` values into constructor args (currently introduces a crash when `extra_config=None`).

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

mengniwang95 · 2026-04-29T06:41:31Z

/azp run Unit-Test-CUDA-AutoRound

azure-pipelines · 2026-04-29T06:41:40Z

Azure Pipelines successfully started running 1 pipeline(s).

for more information, see https://pre-commit.ci

mengniwang95 · 2026-04-30T03:34:42Z

/azp run Unit-Test-CUDA-AutoRound

azure-pipelines · 2026-04-30T03:34:51Z

Azure Pipelines successfully started running 1 pipeline(s).

mengniwang95 · 2026-04-30T05:14:46Z

/azp run Unit-Test-CUDA-AutoRound

azure-pipelines · 2026-04-30T05:14:55Z

Azure Pipelines successfully started running 1 pipeline(s).

yiliu30

lgtm, It’d be great to have a few UTs to validate the model config.

Co-authored-by: Copilot <copilot@github.com>

for more information, see https://pre-commit.ci

Co-authored-by: Copilot <copilot@github.com>

mengniwang95 · 2026-04-30T14:30:00Z

/azp run Unit-Test-CUDA-AutoRound

azure-pipelines · 2026-04-30T14:30:10Z

Azure Pipelines successfully started running 1 pipeline(s).

Fix incompatible weight names

de23c8a

Co-authored-by: Copilot <copilot@github.com>

Copilot AI review requested due to automatic review settings April 29, 2026 03:32

[pre-commit.ci] auto fixes from pre-commit.com hooks

c927d44

for more information, see https://pre-commit.ci

Copilot AI reviewed Apr 29, 2026

View reviewed changes

Comment thread auto_round/compressors_new/shard_writer.py

Comment thread auto_round/autoround.py Outdated

Comment thread auto_round/compressors/shard_writer.py

mengniwang95 and others added 2 commits April 29, 2026 13:54

Merge branch 'main' into mengni/fix_vlm

3b664ad

Apply suggestion from @Copilot

021133e

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

mengniwang95 requested a review from n1ck-guo April 29, 2026 06:32

mengniwang95 requested review from xin3he and yiliu30 April 29, 2026 07:50

mengniwang95 and others added 3 commits April 30, 2026 10:37

add ut

8389c52

Merge branch 'main' into mengni/fix_vlm

d5fe788

[pre-commit.ci] auto fixes from pre-commit.com hooks

0d00b12

for more information, see https://pre-commit.ci

fix ut

8b666af

yiliu30 approved these changes Apr 30, 2026

View reviewed changes

mengniwang95 and others added 7 commits April 30, 2026 16:49

adapt to latest transformers

0fb900c

Co-authored-by: Copilot <copilot@github.com>

Merge branch 'main' into mengni/fix_vlm

00bcd1c

[pre-commit.ci] auto fixes from pre-commit.com hooks

90ed33f

for more information, see https://pre-commit.ci

add config check in ut

49f5124

bug fix

26e828c

Co-authored-by: Copilot <copilot@github.com>

fix CI issue

6035268

Merge branch 'main' into mengni/fix_vlm

bdc3142

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix incompatible weight names#1759

Fix incompatible weight names#1759
mengniwang95 wants to merge 15 commits intomainfrom
mengni/fix_vlm

mengniwang95 commented Apr 29, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mengniwang95 commented Apr 29, 2026

Uh oh!

azure-pipelines Bot commented Apr 29, 2026

Uh oh!

mengniwang95 commented Apr 30, 2026

Uh oh!

azure-pipelines Bot commented Apr 30, 2026

Uh oh!

mengniwang95 commented Apr 30, 2026

Uh oh!

azure-pipelines Bot commented Apr 30, 2026

Uh oh!

yiliu30 left a comment •

edited

Loading

Uh oh!

mengniwang95 commented Apr 30, 2026

Uh oh!

azure-pipelines Bot commented Apr 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

mengniwang95 commented Apr 29, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Type of Change

Related Issues

Checklist Before Submitting

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mengniwang95 commented Apr 29, 2026

Uh oh!

azure-pipelines Bot commented Apr 29, 2026

Uh oh!

mengniwang95 commented Apr 30, 2026

Uh oh!

azure-pipelines Bot commented Apr 30, 2026

Uh oh!

mengniwang95 commented Apr 30, 2026

Uh oh!

azure-pipelines Bot commented Apr 30, 2026

Uh oh!

yiliu30 left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

mengniwang95 commented Apr 30, 2026

Uh oh!

azure-pipelines Bot commented Apr 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

mengniwang95 commented Apr 29, 2026 •

edited

Loading

yiliu30 left a comment •

edited

Loading