Q1_0: port CUDA kernels by pwilkin · Pull Request #21584 · ggml-org/llama.cpp

pwilkin · 2026-04-07T20:37:02Z

Overview

CUDA kernels for Q1_0 ported from the original fork.

Additional information

The CUDA kernels from the original fork's Q1_0_g128.

Requirements

I have read and agree with the contributing guidelines
AI usage disclosure: Yes, used Kimi 2.5 to compare the branch and port the kernels

am17an · 2026-04-08T01:07:42Z

I think the original authors already had planned CUDA kernels. If so, we should let them add them

pwilkin · 2026-04-08T10:49:32Z

@khosravipasha can you please take a look and comment?

khosravipasha · 2026-04-08T14:53:50Z

Oh thanks, did not see this just submitted the CUDA PR: #21629
was waiting for Metal backend to get merged.
I might need some help with tuning the kernels not a GPU expert myself, but so far the speed ups were satisfactory.

pwilkin · 2026-04-08T15:08:03Z

Ah, no worries :)

Obsoleted by #21629

port kernels from original fork

b434dc2

pwilkin requested a review from a team as a code owner April 7, 2026 20:37

pwilkin mentioned this pull request Apr 7, 2026

Eval bug: Very slow inference of Q1_0 Bonsai model #21574

Open

github-actions Bot added Nvidia GPU Issues specific to Nvidia GPUs ggml changes relating to the ggml tensor library for machine learning labels Apr 8, 2026

pwilkin closed this Apr 8, 2026

Marxist-Leninist mentioned this pull request Apr 10, 2026

(Performance) Optimized x86 and generic q1_0(_g128) dot PrismML-Eng/llama.cpp#10

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Q1_0: port CUDA kernels#21584

Q1_0: port CUDA kernels#21584
pwilkin wants to merge 1 commit intoggml-org:masterfrom
pwilkin:q1-cuda-kernels

pwilkin commented Apr 7, 2026

Uh oh!

am17an commented Apr 8, 2026

Uh oh!

pwilkin commented Apr 8, 2026

Uh oh!

khosravipasha commented Apr 8, 2026

Uh oh!

pwilkin commented Apr 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

pwilkin commented Apr 7, 2026

Overview

Additional information

Requirements

Uh oh!

am17an commented Apr 8, 2026

Uh oh!

pwilkin commented Apr 8, 2026

Uh oh!

khosravipasha commented Apr 8, 2026

Uh oh!

pwilkin commented Apr 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants