Skip to content

vulkan: fix shmem overrun in mmq id shader#16873

Merged
0cc4m merged 2 commits intomasterfrom
0cc4m/vulkan-workaround-mmqid-shmem-bug
Oct 31, 2025
Merged

vulkan: fix shmem overrun in mmq id shader#16873
0cc4m merged 2 commits intomasterfrom
0cc4m/vulkan-workaround-mmqid-shmem-bug

Conversation

@0cc4m
Copy link
Copy Markdown
Contributor

@0cc4m 0cc4m commented Oct 30, 2025

This issue caused incoherence in gpt-oss. I forgot to update the shared memory limits for the mmq id shader, and it looks like it overran the limit. I'll create a proper solution later, this change resolves the incoherence for now.

@0cc4m 0cc4m force-pushed the 0cc4m/vulkan-workaround-mmqid-shmem-bug branch from 5ad117d to c9d7147 Compare October 30, 2025 19:18
@0cc4m 0cc4m requested a review from slaren as a code owner October 30, 2025 19:18
@0cc4m 0cc4m requested a review from jeffbolznv October 30, 2025 19:18
Comment thread ggml/src/ggml-vulkan/vulkan-shaders/mul_mmq_shmem_types.glsl
@slaren
Copy link
Copy Markdown
Member

slaren commented Oct 30, 2025

@ggerganov Metal seems to fail this case as well.

@ggerganov ggerganov self-requested a review as a code owner October 30, 2025 21:07
@github-actions github-actions Bot added testing Everything test related Vulkan Issues specific to the Vulkan backend ggml changes relating to the ggml tensor library for machine learning Apple Metal https://en.wikipedia.org/wiki/Metal_(API) labels Oct 30, 2025
LostRuins added a commit to LostRuins/koboldcpp that referenced this pull request Oct 31, 2025
@0cc4m 0cc4m merged commit d2a2673 into master Oct 31, 2025
70 of 72 checks passed
@0cc4m 0cc4m deleted the 0cc4m/vulkan-workaround-mmqid-shmem-bug branch October 31, 2025 07:14
Anico2 added a commit to Anico2/llama.cpp that referenced this pull request Jan 15, 2026
* vulkan: fix shmem overrun in mmq id shader

* metal : fix mul_mm_id

---------

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
blime4 referenced this pull request in blime4/llama.cpp Feb 5, 2026
* vulkan: fix shmem overrun in mmq id shader

* metal : fix mul_mm_id

---------

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
Seunghhon pushed a commit to Seunghhon/llama.cpp that referenced this pull request Apr 26, 2026
* vulkan: fix shmem overrun in mmq id shader

* metal : fix mul_mm_id

---------

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Apple Metal https://en.wikipedia.org/wiki/Metal_(API) ggml changes relating to the ggml tensor library for machine learning testing Everything test related Vulkan Issues specific to the Vulkan backend

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants