ggml: WebGPU disable SET_ROWS for now by reeselevine · Pull Request #15078 · ggml-org/llama.cpp

reeselevine · 2025-08-05T00:42:47Z

test-thread-safety was recently updated to use SET_ROWS by default, but the WebGPU backend doesn't support it yet. I'm aiming to add support and open a PR in the next couple days, but in the meantime disabling SET_ROWS for the CI so the WebGPU tests pass.

…building/submission

slaren · 2025-08-05T00:47:59Z

Looks like it is still failing.

reeselevine · 2025-08-05T04:26:44Z

Sorry this CI failure for WebGPU on the Linux machine is turning out to be trickier than I expected. I haven't been able to reproduce it locally yet, only on the Github action runners using the simulated Vulkan LLVMpipe backend.

I'll keep working on it this week, to see if I can get a definitive answer into what's going on. In the meantime, would it make sense to just disable the WebGPU CI so it doesn't clutter up other PRs?

reeselevine · 2025-08-05T20:06:19Z

Looks like the CI is passing now, I was able to debug on the Github action runner using this: https://github.com/mxschmitt/action-tmate.

I believe the issue was due to not blocking on set_tensor calls. Not sure why it only causes issues on the LLVMpipe backend, but I suppose it's good that the CI caught the issue!

I also made another minor changes in this PR, to explicitly wait on Futures returned by WebGPU API callbacks before returning from graph_compute.

A few of the macOS CI tests are still queued, I'll wait for them to complete successfully before merging. Hopefully the WebGPU CI is more stable from here on!

* Add paramater buffer pool, batching of submissions, refactor command building/submission * Add header for linux builds * Free staged parameter buffers at once * Format with clang-format * Fix thread-safe implementation * Use device implicit synchronization * Update workflow to use custom release * Remove testing branch workflow * Disable set_rows until it's implemented * Fix potential issue around empty queue submission * Try synchronous submission * Try waiting on all futures explicitly * Add debug * Add more debug messages * Work on getting ssh access for debugging * Debug on failure * Disable other tests * Remove extra if * Try more locking * maybe passes? * test * Some cleanups * Restore build file * Remove extra testing branch ci

reeselevine added 11 commits July 30, 2025 12:33

Add paramater buffer pool, batching of submissions, refactor command …

30ba139

…building/submission

Add header for linux builds

04d7b27

Free staged parameter buffers at once

01c8ced

Format with clang-format

bfff27f

Fix thread-safe implementation

b8012ec

Use device implicit synchronization

cddda7e

Merge remote-tracking branch 'upstream/master' into fixes

1d5726a

Update workflow to use custom release

6a20e39

Remove testing branch workflow

ea39068

Merge branch 'ggml-org:master' into master

4c58742

Disable set_rows until it's implemented

ae8edbf

github-actions Bot added the devops improvements to build systems and github actions label Aug 5, 2025

slaren approved these changes Aug 5, 2025

View reviewed changes

reeselevine added 2 commits August 4, 2025 18:35

Merge branch 'ggml-org:master' into master

75eb99b

Fix potential issue around empty queue submission

bfc6930

github-actions Bot added the ggml changes relating to the ggml tensor library for machine learning label Aug 5, 2025

reeselevine added 4 commits August 4, 2025 20:35

Try synchronous submission

69965a8

Try waiting on all futures explicitly

c773e2f

Add debug

5aeab73

Add more debug messages

d4af0d6

0cc4m reviewed Aug 5, 2025

View reviewed changes

Comment thread .github/workflows/build.yml Outdated

reeselevine added 7 commits August 5, 2025 07:27

Work on getting ssh access for debugging

320f679

Debug on failure

f422911

Disable other tests

0feece5

Remove extra if

0512d66

Try more locking

9335adf

maybe passes?

fc9e99d

test

7d9807e

reeselevine added 3 commits August 5, 2025 11:23

Some cleanups

f7745c4

Restore build file

4dc409a

Remove extra testing branch ci

3b81c99

reeselevine merged commit 9515c61 into ggml-org:master Aug 5, 2025
47 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ggml: WebGPU disable SET_ROWS for now#15078

ggml: WebGPU disable SET_ROWS for now#15078
reeselevine merged 27 commits intoggml-org:masterfrom
reeselevine:master

reeselevine commented Aug 5, 2025

Uh oh!

slaren commented Aug 5, 2025

Uh oh!

reeselevine commented Aug 5, 2025

Uh oh!

Uh oh!

reeselevine commented Aug 5, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

reeselevine commented Aug 5, 2025

Uh oh!

slaren commented Aug 5, 2025

Uh oh!

reeselevine commented Aug 5, 2025

Uh oh!

Uh oh!

reeselevine commented Aug 5, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants