Skip to content

CUDA: fix compilation on CC 6.0#16091

Merged
JohannesGaessler merged 1 commit intoggml-org:masterfrom
JohannesGaessler:cuda-fa-tile-fix-compile
Sep 18, 2025
Merged

CUDA: fix compilation on CC 6.0#16091
JohannesGaessler merged 1 commit intoggml-org:masterfrom
JohannesGaessler:cuda-fa-tile-fix-compile

Conversation

@JohannesGaessler
Copy link
Copy Markdown
Contributor

Fixes #16081 . The problem is that the Python script I used to determine shared memory use neglected one small part and I didn't notice because the compilation only fails on trying to generate a real architecture.

@github-actions github-actions Bot added Nvidia GPU Issues specific to Nvidia GPUs ggml changes relating to the ggml tensor library for machine learning labels Sep 18, 2025
@JohannesGaessler JohannesGaessler merged commit 368560a into ggml-org:master Sep 18, 2025
54 of 55 checks passed
Nexesenex added a commit to Nexesenex/croco.cpp that referenced this pull request Sep 27, 2025
Nexesenex added a commit to Nexesenex/croco.cpp that referenced this pull request Sep 29, 2025
Nexesenex added a commit to Nexesenex/croco.cpp that referenced this pull request Oct 4, 2025
pwilkin pushed a commit to pwilkin/llama.cpp that referenced this pull request Oct 23, 2025
blime4 referenced this pull request in blime4/llama.cpp Feb 5, 2026
Seunghhon pushed a commit to Seunghhon/llama.cpp that referenced this pull request Apr 26, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Compile bug: CUDA build uses too much shared data

2 participants