CUDA: revert part of the RDNA1 optimizations by daniandtheweb · Pull Request #8309 · ggml-org/llama.cpp

daniandtheweb · 2024-07-04T19:40:59Z

The change on the launch_bounds was causing a small performance drop in prompt processing, apparently this change was only beneficial before I tuned the mmq_y values.

model	size	params	backend	ngl	test	t/s master	t/s PR	Speedup
llama 8B Q5_K - Small	5.21 GiB	8.03 B	ROCm	99	pp512	276.60 ± 0.41	300.60 ± 0.46	1.09

I have read the contributing guidelines
Self-reported review complexity:
- Low
- Medium
- High

The change on the launch_bounds was causing a small performance drop in perplexity of 25 t/s

CUDA: revert part of the RDNA1 optimizations

9f3e9e3

The change on the launch_bounds was causing a small performance drop in perplexity of 25 t/s

github-actions Bot added the Nvidia GPU Issues specific to Nvidia GPUs label Jul 4, 2024

JohannesGaessler approved these changes Jul 4, 2024

View reviewed changes

JohannesGaessler merged commit 0a42380 into ggml-org:master Jul 5, 2024

arthw pushed a commit to arthw/llama.cpp that referenced this pull request Jul 13, 2024

CUDA: revert part of the RDNA1 optimizations (ggml-org#8309)

4bb7223

The change on the launch_bounds was causing a small performance drop in perplexity of 25 t/s

Seunghhon pushed a commit to Seunghhon/llama.cpp that referenced this pull request Apr 26, 2026

CUDA: revert part of the RDNA1 optimizations (ggml-org#8309)

dcc4206

The change on the launch_bounds was causing a small performance drop in perplexity of 25 t/s

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CUDA: revert part of the RDNA1 optimizations#8309

CUDA: revert part of the RDNA1 optimizations#8309
JohannesGaessler merged 1 commit intoggml-org:masterfrom
daniandtheweb:gfx1010_optimizations

daniandtheweb commented Jul 4, 2024 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

daniandtheweb commented Jul 4, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

daniandtheweb commented Jul 4, 2024 •

edited

Loading