llama : use n_embd_head_v instead of n_embd_head_k when reshaping kqv by fairydreaming · Pull Request #7327 · ggml-org/llama.cpp

fairydreaming · 2024-05-16T12:37:17Z

When reshaping kqv at the end of llm_build_kqv() n_embd_head_k is incorrectly used instead of n_embd_head_v to calculate kqv dimensions.

…d n_embd_head_k when making a view of cached value vectors.

fairydreaming · 2024-05-17T07:31:46Z

I found another place when variables for key vectors were used for processing value vectors, so I added another commit to this PR.

ggerganov

Which models are affected by this?

fairydreaming · 2024-05-17T09:54:44Z

DeepSeek-V2 needs this since it has n_embd_head_k != n_embd_head_v, I'm not sure about other models:

llm_load_print_meta: n_embd_head_k    = 192
llm_load_print_meta: n_embd_head_v    = 128

* llama : use n_embd_head_v instead of n_embd_head_k when reshaping kqv * llama : use n_embd_v_gqa and n_embd_head_v instead of n_embd_k_gqa and n_embd_head_k when making a view of cached value vectors. --------- Co-authored-by: Stanisław Szymczyk <sszymczy@gmail.com>

llama : use n_embd_head_v instead of n_embd_head_k when reshaping kqv

f15e933

mofosyne added bugfix fixes an issue or bug Review Complexity : Medium Generally require more time to grok but manageable by beginner to medium expertise level labels May 16, 2024

llama : use n_embd_v_gqa and n_embd_head_v instead of n_embd_k_gqa an…

886f89d

…d n_embd_head_k when making a view of cached value vectors.

ggerganov approved these changes May 17, 2024

View reviewed changes

ggerganov merged commit 27b0406 into ggml-org:master May 17, 2024

fairydreaming deleted the llm_build_kqv_fix branch March 22, 2025 17:50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llama : use n_embd_head_v instead of n_embd_head_k when reshaping kqv#7327

llama : use n_embd_head_v instead of n_embd_head_k when reshaping kqv#7327
ggerganov merged 2 commits intoggml-org:masterfrom
fairydreaming:llm_build_kqv_fix

fairydreaming commented May 16, 2024

Uh oh!

fairydreaming commented May 17, 2024

Uh oh!

ggerganov left a comment

Uh oh!

fairydreaming commented May 17, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

fairydreaming commented May 16, 2024

Uh oh!

fairydreaming commented May 17, 2024

Uh oh!

ggerganov left a comment

Choose a reason for hiding this comment

Uh oh!

fairydreaming commented May 17, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants