Qwen2: assume tied weights if lm_head/output weights is missing by jklj077 · Pull Request #6738 · ggml-org/llama.cpp

jklj077 · 2024-04-18T09:55:54Z

This PR adds the proper support of Qwen2-0.5B, which uses tied word embeddings. Example config: https://huggingface.co/Qwen/Qwen1.5-0.5B-Chat/blob/main/config.json#L21.

Previous attempt: #6578

…l-org#6738)

Qwen2: assume tied weights if lm_head/output weights is missing

c7ab76e

slaren approved these changes Apr 18, 2024

View reviewed changes

ggerganov merged commit e11b2e6 into ggml-org:master Apr 18, 2024

Seunghhon pushed a commit to Seunghhon/llama.cpp that referenced this pull request Apr 26, 2026

Qwen2 : assume tied weights if lm_head/output weights is missing (ggm…

2a0db04

…l-org#6738)

phuongncn pushed a commit to phuongncn/llama.cpp-gx10-dgx-sparks-deepseekv4 that referenced this pull request Apr 28, 2026

Qwen2 : assume tied weights if lm_head/output weights is missing (ggm…

e68489b

…l-org#6738)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Qwen2: assume tied weights if lm_head/output weights is missing#6738

Qwen2: assume tied weights if lm_head/output weights is missing#6738
ggerganov merged 1 commit intoggml-org:masterfrom
jklj077:fix-qwen2-0.5b

jklj077 commented Apr 18, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

jklj077 commented Apr 18, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants