[Bug fix] Fixed the garbled text issues in Qwen3-8B #2737

lizexu123 · 2025-07-07T13:21:18Z

修复了Qwen3-8b乱码问题， tie_word_embeddings一直为True的情况，考虑了tie_word_embeddings为False的情况，并且添加了权重lm_head.weight为列切的方式加载。修复后Qwen3-8b精度正确

CLAassistant · 2025-07-07T13:21:24Z

Thank you for your submission! We really appreciate it. Like many open source projects, we ask that you sign our Contributor License Agreement before we can accept your contribution.
_{You have signed the CLA already but the status is still pending? Let us recheck it.}

yuanlehome · 2025-07-07T14:03:26Z

fastdeploy/model_executor/layers/lm_head.py

        else:
            if self.tie_word_embeddings:
                self.out_linear.weight.set_value(
-                    get_tensor(state_dict.pop(self.linear_weight_key)).astype(


这里要恢复

yuanlehome · 2025-07-07T14:04:18Z

fastdeploy/model_executor/models/qwen3.py

+            prefix="lm_head",
+            )


format code

qingqing01 · 2025-07-08T02:21:23Z

fastdeploy/model_executor/models/qwen3.py

            embedding_dim=fd_config.model_config.hidden_size,
            num_embeddings=fd_config.model_config.vocab_size,
-            prefix=(f"{fd_config.model_config.prefix_name}.embed_tokens"),
+            prefix="lm_head",


tie_word_embeddings True 或 False 时，prefix 这里都相同嘛？看起来 267行 True 的时候没用这个 prefix？

上一个commit是有区分的，远乐建议参考下ernie4_5_moe.py中的写法，应该是在下面set_state_dict下面，如果是权重共享，就把lm_head.out_linear的权重设置为embedding层的权重，本地测过qwen3-0.6b，精度正确

…into qwen3-8b

fix qwen3.py

88c4ca6

ming1753 closed this Jul 7, 2025

ming1753 reopened this Jul 7, 2025

lizexu-baidu added 2 commits July 7, 2025 13:28

update

bc6bdc5

update lm_head tie_word_embeddings

f5c79d3

Jiang-Jia-Jun requested a review from yuanlehome July 7, 2025 13:42

lizexu123 changed the title ~~fix qwen3.py~~ [Bug fix] fix qwen3.py Jul 7, 2025

lizexu123 changed the title ~~[Bug fix] fix qwen3.py~~ [Bug fix] Fixed the garbled text issues in Qwen3-8B Jul 7, 2025

update tie_word_embeddings

8198c11

yuanlehome reviewed Jul 7, 2025

View reviewed changes

fastdeploy/model_executor/models/qwen3.py Outdated

Comment on lines 247 to 248

prefix="lm_head",

)

Copy link

Collaborator

yuanlehome Jul 7, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

format code

fix

fecbfa1

yuanlehome previously approved these changes Jul 7, 2025

View reviewed changes

Jiang-Jia-Jun previously approved these changes Jul 8, 2025

View reviewed changes

qingqing01 reviewed Jul 8, 2025

View reviewed changes

fix tie_word_embedding not in config.json

acc9889

lizexu123 dismissed stale reviews from Jiang-Jia-Jun and yuanlehome via acc9889 July 8, 2025 02:38

lizexu123 closed this Jul 8, 2025

lizexu123 reopened this Jul 8, 2025

vivienfanghuagood previously approved these changes Jul 8, 2025

View reviewed changes

lizexu-baidu added 2 commits July 8, 2025 03:47

Merge branch 'develop' of https://github.com/PaddlePaddle/FastDeploy …

5157f41

…into qwen3-8b

merge develop

1fc7293

lizexu123 dismissed vivienfanghuagood’s stale review via 1fc7293 July 8, 2025 04:42

yuanlehome approved these changes Jul 8, 2025

View reviewed changes

yuanlehome merged commit 525be24 into PaddlePaddle:develop Jul 8, 2025
2 of 3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bug fix] Fixed the garbled text issues in Qwen3-8B #2737

[Bug fix] Fixed the garbled text issues in Qwen3-8B #2737

Uh oh!

lizexu123 commented Jul 7, 2025 •

edited

Loading

Uh oh!

CLAassistant commented Jul 7, 2025

Uh oh!

yuanlehome Jul 7, 2025

Uh oh!

yuanlehome Jul 7, 2025

Uh oh!

qingqing01 Jul 8, 2025

Uh oh!

lizexu123 Jul 8, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

8 participants

[Bug fix] Fixed the garbled text issues in Qwen3-8B #2737

[Bug fix] Fixed the garbled text issues in Qwen3-8B #2737

Uh oh!

Conversation

lizexu123 commented Jul 7, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

CLAassistant commented Jul 7, 2025

Uh oh!

yuanlehome Jul 7, 2025

Choose a reason for hiding this comment

Uh oh!

yuanlehome Jul 7, 2025

Choose a reason for hiding this comment

Uh oh!

qingqing01 Jul 8, 2025

Choose a reason for hiding this comment

Uh oh!

lizexu123 Jul 8, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

8 participants

lizexu123 commented Jul 7, 2025 •

edited

Loading