Skip to content

CUDA: fix LoRAs#3130

Merged
JohannesGaessler merged 1 commit intoggml-org:masterfrom
JohannesGaessler:cuda-fix-lora
Sep 12, 2023
Merged

CUDA: fix LoRAs#3130
JohannesGaessler merged 1 commit intoggml-org:masterfrom
JohannesGaessler:cuda-fix-lora

Conversation

@JohannesGaessler
Copy link
Copy Markdown
Contributor

As pointed out by #3110 (comment) , the CUDA code for LoRAs was broken by #3110 . This PR fixes this.

Comment thread ggml-cuda.cu Outdated
@JohannesGaessler
Copy link
Copy Markdown
Contributor Author

Previously ggml_cpy_tensor_2d was not called for LoRAs. As it turns out the logic for src0_on_device was incorrect so the tensor was being copied unnecessarily. I'm keeping the extended logic for ggml_cpy_tensor_2d.

@JohannesGaessler JohannesGaessler merged commit 4f7cd6b into ggml-org:master Sep 12, 2023
pkrmf pushed a commit to morlockstudios-com/llama.cpp that referenced this pull request Sep 26, 2023
Seunghhon pushed a commit to Seunghhon/llama.cpp that referenced this pull request Apr 26, 2026
phuongncn pushed a commit to phuongncn/llama.cpp-gx10-dgx-sparks-deepseekv4 that referenced this pull request Apr 28, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants