Fix save_pretrained writing incorrect tie_word_embeddings=True config after PEFT merge by Cursx · Pull Request #45156 · huggingface/transformers

Cursx · 2026-04-01T02:36:38Z

What does this PR do?

After the merge_and_unload() operation in PEFT, embed_tokens and lm_head become independent tensors with different values, but config.tie_word_embeddings remains True. The load-side already detects this using torch.equal in tie_weights() and skips tying, but save_pretrained() writes the incorrect config as-is.（tie_word_embeddings=True is already in a "semantic error" state in memory. Changing it to False is closer to the actual situation.）

issue # 45127——PEFT merge_and_unload() creates an inconsistent state (weights have been separated, but configuration has not been updated).
impact: Downstream tools (GGUF converters, quantization scripts) trust this config directly, leading to silent weight corruption.

test

I wrote a simple script to reproduce the problem and tested it locally.

I ran make fix-repo and performed the following related tests：
test_save_pretrained_auto_fixes_diverged_tied_embeddings（new test）
test_tied_weights_are_not_tied_if_both_present_but_different（load-side ）
test_tied_weights_are_tied_if_both_present_and_similar
test_tied_weights_are_always_tied_from_config

The CI test is also passed.

Fixes #45127

Code Agent Policy

The Transformers repo is currently being overwhelmed by a large number of PRs and issue comments written by
code agents. We are currently bottlenecked by our ability to review and respond to them. As a result,
we ask that new users do not submit pure code agent PRs at this time.
You may use code agents in drafting or to help you diagnose issues. We'd also ask autonomous "OpenClaw"-like agents
not to open any PRs or issues for the moment.

PRs that appear to be fully agent-written will probably be closed without review, and we may block users who do this
repeatedly or maliciously.

This is a rapidly-evolving situation that's causing significant shockwaves in the open-source community. As a result,
this policy is likely to be updated regularly in the near future. For more information, please read CONTRIBUTING.md.

I confirm that this is not a pure code agent PR.

I used multiple AI models (Gemini, Claude, Kimi) to cross-validate edge cases and boundary conditions — different models behave differently around tied embeddings, which made CI failures harder to predict than expected. AI helped me locate these edge cases and I verified they weren't hallucinations.

I have read CONTRIBUTING.md, and tried my best to follow the instructions therein.

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.[Bug] Model collapse after merging LoRA with extended vocabulary on models with tie_word_embeddings=True (e.g., Qwen2.5 0.5B) #45127
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

@Cyrilvallez @BenjaminBossan

…re independently modified outside of Transformers (e.g., via PEFT)

github-actions · 2026-04-01T09:27:53Z

View the CircleCI Test Summary for this PR:

https://huggingface.co/spaces/transformers-community/circle-ci-viz?pr=45156&sha=440822

Cursx added 7 commits March 31, 2026 11:03

Fix save_pretrained() to set tie_word_embeddings=False when weights a…

d4e9939

…re independently modified outside of Transformers (e.g., via PEFT)

Merge branch 'main' into fix-issue

4572c62

logic bug

773cd6c

fix

cc75e7c

Minimize the diff

999141d

warning

739c0a8

Merge branch 'main' into fix-issue

ef7939f

Cursx changed the title ~~Fix issue~~ Fix save_pretrained writing incorrect tie_word_embeddings=True config after PEFT merge Apr 1, 2026

Cursx added 2 commits April 1, 2026 14:32

Retry CI

a360ae8

Trim test to core assertion

440822a

Cursx closed this Apr 1, 2026

evalstate mentioned this pull request Apr 28, 2026

Cumulative defect fixes from recent Transformers PRs evalstate/transformers#41

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix save_pretrained writing incorrect tie_word_embeddings=True config after PEFT merge#45156

Fix save_pretrained writing incorrect tie_word_embeddings=True config after PEFT merge#45156
Cursx wants to merge 9 commits intohuggingface:mainfrom
Cursx:fix-issue

Cursx commented Apr 1, 2026 •

edited

Loading

Uh oh!

github-actions Bot commented Apr 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Cursx commented Apr 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

test

Code Agent Policy

Before submitting

Who can review?

Uh oh!

github-actions Bot commented Apr 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Cursx commented Apr 1, 2026 •

edited

Loading