gguf_convert_endian.py: skip MXFP4 data by AlekseiNikiforovIBM · Pull Request #17523 · ggml-org/llama.cpp

AlekseiNikiforovIBM · 2025-11-26T16:02:24Z

MXFP4 data only uses 1-byte data, there's nothing to byteswap.

Also use gguf.constants.GGML_QUANT_SIZES to determine block sizes. Don't duplicate block size constants.

CISC · 2025-11-26T21:58:43Z

Hmmm, won't converting gpt-oss mess up for the same reason? I think GGUFWriter.add_tensor/write_tensor_data needs a check.

AlekseiNikiforovIBM · 2025-11-27T10:28:28Z

Hmmm, won't converting gpt-oss mess up for the same reason? I think GGUFWriter.add_tensor/write_tensor_data needs a check.

byteswap calls in GGUFWriter.add_tensor/write_tensor_data should be already a no-op if data elements are 1-byte-sized. But this conversion script requires explicit handling for each data structure.

CISC · 2025-11-27T10:43:22Z

@AlekseiNikiforovIBM Mind having a look at gguf_new_metadata.py and gguf_editor_gui.py? I have a feeling they might mess up when saving a non-native endian GGUF.

AlekseiNikiforovIBM · 2025-11-27T10:49:56Z

I can take a look at these scripts. Are there any examples on how to use gguf_new_metadata.py?

CISC · 2025-11-27T10:59:58Z

I can take a look at these scripts. Are there any examples on how to use gguf_new_metadata.py?

A very simple test would just be gguf-new-metadata --general-name foo input.gguf output.gguf

* gguf_convert_endian.py: skip MXFP4 data * Use gguf.constants.GGML_QUANT_SIZES to determine block sizes

AlekseiNikiforovIBM added 2 commits November 26, 2025 17:00

gguf_convert_endian.py: skip MXFP4 data

810f257

Use gguf.constants.GGML_QUANT_SIZES to determine block sizes

8545c14

AlekseiNikiforovIBM requested a review from CISC as a code owner November 26, 2025 16:02

github-actions bot added the python python script changes label Nov 26, 2025

loci-dev mentioned this pull request Nov 26, 2025

UPSTREAM PR #17523: gguf_convert_endian.py: skip MXFP4 data auroralabs-loci/llama.cpp#335

Open

CISC approved these changes Nov 26, 2025

View reviewed changes

taronaeo approved these changes Nov 27, 2025

View reviewed changes

CISC merged commit 4fcd87c into ggml-org:master Nov 27, 2025
4 of 5 checks passed

Anico2 added a commit to Anico2/llama.cpp that referenced this pull request Jan 15, 2026

gguf-py : skip endian-conversion of MXFP4 data (ggml-org#17523)

8512de7

* gguf_convert_endian.py: skip MXFP4 data * Use gguf.constants.GGML_QUANT_SIZES to determine block sizes

blime4 referenced this pull request in blime4/llama.cpp Feb 5, 2026

gguf-py : skip endian-conversion of MXFP4 data (#17523)

d9e4661

* gguf_convert_endian.py: skip MXFP4 data * Use gguf.constants.GGML_QUANT_SIZES to determine block sizes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

gguf_convert_endian.py: skip MXFP4 data#17523

gguf_convert_endian.py: skip MXFP4 data#17523
CISC merged 2 commits intoggml-org:masterfrom
AlekseiNikiforovIBM:byteswap_mxfp4

AlekseiNikiforovIBM commented Nov 26, 2025

Uh oh!

CISC commented Nov 26, 2025

Uh oh!

AlekseiNikiforovIBM commented Nov 27, 2025

Uh oh!

Uh oh!

CISC commented Nov 27, 2025

Uh oh!

AlekseiNikiforovIBM commented Nov 27, 2025

Uh oh!

CISC commented Nov 27, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

AlekseiNikiforovIBM commented Nov 26, 2025

Uh oh!

CISC commented Nov 26, 2025

Uh oh!

AlekseiNikiforovIBM commented Nov 27, 2025

Uh oh!

Uh oh!

CISC commented Nov 27, 2025

Uh oh!

AlekseiNikiforovIBM commented Nov 27, 2025

Uh oh!

CISC commented Nov 27, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants