Skip to content

CUDA: static assert to prevent misuse of memcpy_1#17198

Merged
JohannesGaessler merged 1 commit intoggml-org:masterfrom
JohannesGaessler:cuda-memcpy_1-assert
Nov 12, 2025
Merged

CUDA: static assert to prevent misuse of memcpy_1#17198
JohannesGaessler merged 1 commit intoggml-org:masterfrom
JohannesGaessler:cuda-memcpy_1-assert

Conversation

@JohannesGaessler
Copy link
Copy Markdown
Contributor

Adds a static assert to prevent the use of ggml_cuda_memcpy_1 in a way that's not intended. The problem is that the purpose of the function is to do properly coalesced and aligned copies, if copies larger than the maximum amount of bytes per instruction are allowed that will complicate making possible changes in the future.

@github-actions github-actions Bot added Nvidia GPU Issues specific to Nvidia GPUs ggml changes relating to the ggml tensor library for machine learning labels Nov 12, 2025
@JohannesGaessler JohannesGaessler merged commit 5d6838b into ggml-org:master Nov 12, 2025
60 of 61 checks passed
Nexesenex added a commit to Nexesenex/croco.cpp that referenced this pull request Nov 24, 2025
Nexesenex added a commit to Nexesenex/croco.cpp that referenced this pull request Nov 28, 2025
Nexesenex added a commit to Nexesenex/croco.cpp that referenced this pull request Dec 1, 2025
Nexesenex added a commit to Nexesenex/croco.cpp that referenced this pull request Dec 20, 2025
Anico2 added a commit to Anico2/llama.cpp that referenced this pull request Jan 15, 2026
Seunghhon pushed a commit to Seunghhon/llama.cpp that referenced this pull request Apr 26, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants