Upgrade init_tensor API to return a ggml_status by WilliamTambellini · Pull Request #11854 · ggml-org/llama.cpp

WilliamTambellini · 2025-02-14T01:49:25Z

To prepare for an 'abort-free' ggml, as agreeed with Diego in the ggml repo, upgrade the backend init_tensor APIs to return a ggml_status.

Make sure to read the contributing guidelines before submitting a PR

WilliamTambellini · 2025-02-14T19:34:40Z

@slaren review please. Tks.

WilliamTambellini · 2025-02-18T17:09:56Z

Tks @slaren
Reready for review.

graehl

ok, so ggml_backend_*_buffer_init_tensor can only return success for most backends but since it's called through the interface init_tensor pointer they still need to return success. was the plan to eventually make cuda_init_tensor sometimes return an error?

WilliamTambellini · 2025-02-19T17:23:19Z

Tks @graehl

so ggml_backend_*_buffer_init_tensor can only return success for most backends but since it's called through the interface init_tensor pointer they still need to return success. was the plan to eventually make cuda_init_tensor sometimes return an error?

Yes but that a another PR in the ggml repo

WilliamTambellini · 2025-02-19T17:23:43Z

@slaren reready for review please. Best.

matiaslin

Good step forward towards the goal of returning an error instead of crashing.

WilliamTambellini · 2025-02-21T19:23:39Z

@ggerganov review please

slaren · 2025-02-25T23:18:49Z

This will leak memory if it fails.

To prepare for an 'abort-free' ggml (ggml not to abort on OOMs but return a OOM status), as agreeed with Diego in the ggml repo, upgrade the init_tensor() and view_init() APIs to return a ggml_status.

WilliamTambellini · 2025-03-04T17:58:23Z

@ggerganov I now have to retouch my PR in ggml. Could you please trigger a sync of ggml from llamacpp to the ggml repo?

ggerganov · 2025-03-04T19:25:27Z

@WilliamTambellini Should be good now.

* Upgrade init_tensor API to return a ggml_status To prepare for an 'abort-free' ggml (ggml not to abort on OOMs but return a OOM status), as agreeed with Diego in the ggml repo, upgrade the init_tensor() and view_init() APIs to return a ggml_status. * misc fixes --------- Co-authored-by: slaren <slarengh@gmail.com>

github-actions Bot added testing Everything test related Nvidia GPU Issues specific to Nvidia GPUs Vulkan Issues specific to the Vulkan backend ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language labels Feb 14, 2025

WilliamTambellini force-pushed the init_tensor branch from 150ffe8 to d12a712 Compare February 14, 2025 18:29

slaren reviewed Feb 17, 2025

View reviewed changes

Comment thread ggml/src/ggml-backend.cpp Outdated

Comment thread ggml/src/ggml-cuda/CMakeLists.txt Outdated

Comment thread ggml/src/ggml-cuda/ggml-cuda.cu Outdated

WilliamTambellini force-pushed the init_tensor branch from d12a712 to 1205554 Compare February 18, 2025 17:09

WilliamTambellini mentioned this pull request Feb 18, 2025

Add option not to abort on cuda malloc errors ggml-org/ggml#1083

Open

slaren reviewed Feb 18, 2025

View reviewed changes

Comment thread ggml/src/ggml-backend.cpp Outdated

Comment thread tests/test-backend-ops.cpp Outdated

Comment thread ggml/src/ggml-backend.cpp Outdated

WilliamTambellini force-pushed the init_tensor branch 4 times, most recently from e2486eb to 51a0f6c Compare February 18, 2025 23:48

This comment was marked as outdated.

Sign in to view

graehl approved these changes Feb 19, 2025

View reviewed changes

matiaslin approved these changes Feb 20, 2025

View reviewed changes

WilliamTambellini requested a review from slaren February 25, 2025 17:06

slaren reviewed Feb 25, 2025

View reviewed changes

WilliamTambellini force-pushed the init_tensor branch from 51a0f6c to 1bae362 Compare February 26, 2025 23:53

Upgrade init_tensor API to return a ggml_status

aa12f29

To prepare for an 'abort-free' ggml (ggml not to abort on OOMs but return a OOM status), as agreeed with Diego in the ggml repo, upgrade the init_tensor() and view_init() APIs to return a ggml_status.

WilliamTambellini force-pushed the init_tensor branch from 1bae362 to aa12f29 Compare February 27, 2025 03:46

WilliamTambellini requested review from matiaslin and slaren February 27, 2025 17:48

misc fixes

d04a23b

slaren approved these changes Feb 28, 2025

View reviewed changes

ggerganov approved these changes Feb 28, 2025

View reviewed changes

slaren merged commit 70680c4 into ggml-org:master Feb 28, 2025

ag2s20150909 mentioned this pull request Mar 3, 2025

Fix kleidiai build #12159

Merged

Conversation

WilliamTambellini commented Feb 14, 2025

Uh oh!

WilliamTambellini commented Feb 14, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

WilliamTambellini commented Feb 18, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

This comment was marked as outdated.

Uh oh!

graehl left a comment

Choose a reason for hiding this comment

Uh oh!

WilliamTambellini commented Feb 19, 2025

Uh oh!

WilliamTambellini commented Feb 19, 2025

Uh oh!

matiaslin left a comment

Choose a reason for hiding this comment

Uh oh!

WilliamTambellini commented Feb 21, 2025

Uh oh!

slaren Feb 25, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

WilliamTambellini commented Mar 4, 2025

Uh oh!

ggerganov commented Mar 4, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants