Fix: Skip weight initialization for quantized int8 models by imjbassi · Pull Request #39491 · huggingface/transformers

imjbassi · 2025-07-17T22:25:41Z

Fixes #39366 by skipping weight initialization when all model parameters are non-floating-point types (e.g., int8 from W8A8 quantized models). This avoids a RuntimeError from PyTorch's normal_() function, which cannot handle integer dtypes.

Adds a conditional check inside _load_pretrained_model to skip self.initialize_weights() when appropriate.

Tested to ensure model loads without crashing for quantized cases.

Rocketknight1 · 2025-07-18T12:28:51Z

cc @SunMarc @MekkCyber

Skip weight initialization for quantized models (e.g. int8)

1654043

evalstate added a commit to evalstate/transformers that referenced this pull request Apr 29, 2026

Apply PR huggingface#39491: skip quantized weight init

8c95ec9

This was referenced Apr 29, 2026

Cumulative feature and defect updates from recent Transformers PRs evalstate/transformers#42

Open

Cumulative defect fixes from recent Transformers PRs evalstate/transformers#43

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix: Skip weight initialization for quantized int8 models#39491

Fix: Skip weight initialization for quantized int8 models#39491
imjbassi wants to merge 1 commit intohuggingface:mainfrom
imjbassi:skip-quantized-weight-init

imjbassi commented Jul 17, 2025

Uh oh!

Rocketknight1 commented Jul 18, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

imjbassi commented Jul 17, 2025

Uh oh!

Rocketknight1 commented Jul 18, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants