Skip to content

Qwen2: assume tied weights if lm_head/output weights is missing#6738

Merged
ggerganov merged 1 commit intoggml-org:masterfrom
jklj077:fix-qwen2-0.5b
Apr 18, 2024
Merged

Qwen2: assume tied weights if lm_head/output weights is missing#6738
ggerganov merged 1 commit intoggml-org:masterfrom
jklj077:fix-qwen2-0.5b

Conversation

@jklj077
Copy link
Copy Markdown
Contributor

@jklj077 jklj077 commented Apr 18, 2024

This PR adds the proper support of Qwen2-0.5B, which uses tied word embeddings. Example config: https://huggingface.co/Qwen/Qwen1.5-0.5B-Chat/blob/main/config.json#L21.

Previous attempt: #6578

@ggerganov ggerganov merged commit e11b2e6 into ggml-org:master Apr 18, 2024
Seunghhon pushed a commit to Seunghhon/llama.cpp that referenced this pull request Apr 26, 2026
phuongncn pushed a commit to phuongncn/llama.cpp-gx10-dgx-sparks-deepseekv4 that referenced this pull request Apr 28, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants