CUDA: fix LoRAs by JohannesGaessler · Pull Request #3130 · ggml-org/llama.cpp

JohannesGaessler · 2023-09-11T22:18:28Z

As pointed out by #3110 (comment) , the CUDA code for LoRAs was broken by #3110 . This PR fixes this.

JohannesGaessler · 2023-09-12T08:13:58Z

Previously ggml_cpy_tensor_2d was not called for LoRAs. As it turns out the logic for src0_on_device was incorrect so the tensor was being copied unnecessarily. I'm keeping the extended logic for ggml_cpy_tensor_2d.

slaren reviewed Sep 11, 2023

View reviewed changes

Comment thread ggml-cuda.cu Outdated

JohannesGaessler force-pushed the cuda-fix-lora branch from 7f52cb5 to f97d546 Compare September 11, 2023 22:45

slaren approved these changes Sep 11, 2023

View reviewed changes

CUDA: fix LoRAs

f866663

JohannesGaessler force-pushed the cuda-fix-lora branch from f97d546 to f866663 Compare September 12, 2023 08:11

JohannesGaessler merged commit 4f7cd6b into ggml-org:master Sep 12, 2023

pkrmf pushed a commit to morlockstudios-com/llama.cpp that referenced this pull request Sep 26, 2023

CUDA: fix LoRAs (ggml-org#3130)

b958c90

Seunghhon pushed a commit to Seunghhon/llama.cpp that referenced this pull request Apr 26, 2026

CUDA: fix LoRAs (ggml-org#3130)

d86076e

phuongncn pushed a commit to phuongncn/llama.cpp-gx10-dgx-sparks-deepseekv4 that referenced this pull request Apr 28, 2026

CUDA: fix LoRAs (ggml-org#3130)

99793f9

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CUDA: fix LoRAs#3130

CUDA: fix LoRAs#3130
JohannesGaessler merged 1 commit intoggml-org:masterfrom
JohannesGaessler:cuda-fix-lora

JohannesGaessler commented Sep 11, 2023

Uh oh!

Uh oh!

JohannesGaessler commented Sep 12, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

JohannesGaessler commented Sep 11, 2023

Uh oh!

Uh oh!

JohannesGaessler commented Sep 12, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants