cuda : display total and free VRAM capacity during device initialization by tehsiuhuang · Pull Request #20185 · ggml-org/llama.cpp

tehsiuhuang · 2026-03-07T07:12:41Z

While running the benchark, realized that no VRAM info and had to check in another console
Helps users verify hardware constraints directly from the log output even when running the benchmark
Get total VRAM via cudaGetDeviceProperties and free memory via cudaMemGetInfo.
Tested on Alienware m16 R2 (RTX 4070 Laptop GPU).

Test Result

llama-bench & test-backend-ops

ctest
1 failure (# 14) not related to my change

tehsiuhuang · 2026-03-09T04:56:59Z

@am17an Thanks for your review!!!

…ion (ggml-org#20185)

cuda : display total and free VRAM capacity during device initialization

0c74ee9

github-actions Bot added Nvidia GPU Issues specific to Nvidia GPUs ggml changes relating to the ggml tensor library for machine learning labels Mar 7, 2026

am17an approved these changes Mar 9, 2026

View reviewed changes

am17an merged commit 5f4cdac into ggml-org:master Mar 9, 2026
78 checks passed

bartowski1182 pushed a commit to bartowski1182/llama.cpp that referenced this pull request Mar 10, 2026

cuda : display total and free VRAM capacity during device initializat…

15b285c

…ion (ggml-org#20185)

BobbyL2k mentioned this pull request Mar 15, 2026

Misc. bug: llama-server router mode uses more VRAM than direct loading #20582

Closed

am17an mentioned this pull request Mar 15, 2026

ggml: avoid creating CUDA context during device init #20595

Merged

Ethan-a2 pushed a commit to Ethan-a2/llama.cpp that referenced this pull request Mar 20, 2026

cuda : display total and free VRAM capacity during device initializat…

5916cdd

…ion (ggml-org#20185)

Seunghhon pushed a commit to Seunghhon/llama.cpp that referenced this pull request Apr 26, 2026

cuda : display total and free VRAM capacity during device initializat…

23bb4ab

…ion (ggml-org#20185)

rsenthilkumar6 pushed a commit to rsenthilkumar6/llama.cpp that referenced this pull request May 1, 2026

cuda : display total and free VRAM capacity during device initializat…

41b0284

…ion (ggml-org#20185)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cuda : display total and free VRAM capacity during device initialization#20185

cuda : display total and free VRAM capacity during device initialization#20185
am17an merged 1 commit intoggml-org:masterfrom
tehsiuhuang:feat/cuda-vram-log

tehsiuhuang commented Mar 7, 2026 •

edited

Loading

Uh oh!

Uh oh!

tehsiuhuang commented Mar 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

tehsiuhuang commented Mar 7, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Test Result

Uh oh!

Uh oh!

tehsiuhuang commented Mar 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

tehsiuhuang commented Mar 7, 2026 •

edited

Loading