scripts: add bpw per layer and model by EAddario · Pull Request #14703 · ggml-org/llama.cpp

EAddario · 2025-07-15T21:48:55Z

Since llama-quantize allows users to select a wide range of quant types, sometimes it may not obvious to determine what's the most appropriate weight encoding scheme to use when following the GGUF naming conventions.

This PR modifies gguf_dump.py to display the bits per weight (bpw) for each layer, and for the overall model, when using the --markdown option.

EAddario · 2025-07-16T06:39:44Z

Thank you @CISC

Add bpw per tensor group and model

aa7aa5d

github-actions Bot added the python python script changes label Jul 15, 2025

CISC approved these changes Jul 15, 2025

View reviewed changes

CISC merged commit c81f419 into ggml-org:master Jul 15, 2025
4 checks passed

EAddario deleted the gguf_dump branch July 16, 2025 06:39

blime4 referenced this pull request in blime4/llama.cpp Feb 5, 2026

gguf-py : dump bpw per layer and model in markdown mode (#14703)

d0b28af

Swagatade mentioned this pull request Mar 12, 2026

Feature Request: Support for Sarvam-30b and Sarvam-105b #20175

Closed

4 tasks

Seunghhon pushed a commit to Seunghhon/llama.cpp that referenced this pull request Apr 26, 2026

gguf-py : dump bpw per layer and model in markdown mode (ggml-org#14703)

fae5435

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

scripts: add bpw per layer and model#14703

scripts: add bpw per layer and model#14703
CISC merged 1 commit intoggml-org:masterfrom
EAddario:gguf_dump

EAddario commented Jul 15, 2025

Uh oh!

Uh oh!

EAddario commented Jul 16, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

EAddario commented Jul 15, 2025

Uh oh!

Uh oh!

EAddario commented Jul 16, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants