rpc : track allocated buffers by rgerganov · Pull Request #7411 · ggml-org/llama.cpp

rgerganov · 2024-05-20T09:39:02Z

chraac · 2024-05-20T09:45:26Z

+
+
+    ggml_backend_t backend;
+    std::unordered_set<ggml_backend_buffer_t> buffers;


nice work! just some thought:

should we also handle the life span of client_socket and gml_backend_t in this class?

As I noted in PR #7407, we serve only one client at a time and I don't see reasons why we should add support for multiple clients.

As I noted in PR #7407, we serve only one client at a time and I don't see reasons why we should add support for multiple clients.

okay, then we can do it later, when we wanna introduce multi-client supports.

chraac · 2024-05-20T10:09:00Z

    }
 #endif
-    GGML_PRINT_DEBUG("Connecting to %s\n", endpoint);
+    fprintf(stderr, "Connecting to %s\n", endpoint);


nit: should it be better to use the GGML_PRINT_DEBUG instead of stdio directly?

I think this message is helpful for troubleshooting connection problems, especially when using multiple rpc-servers, so I think it's fine to always print this

chraac · 2024-05-20T12:25:11Z

Test on my machine, the memory was freed correctly after client disconnected:

as we can see in the picture above, the free_mem remain unchange when i disconnect and connect again.

chraac

lgtm

* rpc : track allocated buffers ref: ggml-org#7407 * rpc : pack rpc_tensor tightly

rpc : track allocated buffers

9d3f69d

ref: ggml-org#7407

rgerganov self-assigned this May 20, 2024

rgerganov requested a review from slaren May 20, 2024 09:39

chraac reviewed May 20, 2024

View reviewed changes

mofosyne added Review Complexity : Medium Generally require more time to grok but manageable by beginner to medium expertise level server labels May 20, 2024

chraac reviewed May 20, 2024

View reviewed changes

Comment thread ggml-rpc.cpp

rpc : pack rpc_tensor tightly

299c2c6

slaren approved these changes May 20, 2024

View reviewed changes

ggerganov reviewed May 20, 2024

View reviewed changes

Comment thread ggml-rpc.cpp

ggerganov approved these changes May 20, 2024

View reviewed changes

chraac approved these changes May 20, 2024

View reviewed changes

rgerganov merged commit db10f01 into ggml-org:master May 20, 2024

Seunghhon pushed a commit to Seunghhon/llama.cpp that referenced this pull request Apr 26, 2026

rpc : track allocated buffers (ggml-org#7411)

31f1c6d

* rpc : track allocated buffers ref: ggml-org#7407 * rpc : pack rpc_tensor tightly

phuongncn pushed a commit to phuongncn/llama.cpp-gx10-dgx-sparks-deepseekv4 that referenced this pull request Apr 28, 2026

rpc : track allocated buffers (ggml-org#7411)

9b6d4a5

* rpc : track allocated buffers ref: ggml-org#7407 * rpc : pack rpc_tensor tightly

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

rpc : track allocated buffers#7411

rpc : track allocated buffers#7411
rgerganov merged 2 commits intoggml-org:masterfrom
rgerganov:rpc-track-buffers

rgerganov commented May 20, 2024

Uh oh!

chraac May 20, 2024

Uh oh!

rgerganov May 20, 2024

Uh oh!

chraac May 20, 2024 •

edited

Loading

Uh oh!

chraac May 20, 2024

Uh oh!

rgerganov May 20, 2024

Uh oh!

Uh oh!

chraac commented May 20, 2024

Uh oh!

Uh oh!

chraac left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants



		ggml_backend_t backend;
		std::unordered_set<ggml_backend_buffer_t> buffers;

Conversation

rgerganov commented May 20, 2024

Uh oh!

chraac May 20, 2024

Choose a reason for hiding this comment

Uh oh!

rgerganov May 20, 2024

Choose a reason for hiding this comment

Uh oh!

chraac May 20, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

chraac May 20, 2024

Choose a reason for hiding this comment

Uh oh!

rgerganov May 20, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

chraac commented May 20, 2024

Uh oh!

Uh oh!

chraac left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

chraac May 20, 2024 •

edited

Loading