Improve ability to convert safetensors files. by ubik2 · Pull Request #1276 · ggml-org/llama.cpp

ubik2 · 2023-05-02T09:50:14Z

When loading a safetensors file, ignore the metadata header.
If no pt or pth files are available, attempt to load safetensors files.
Edit: This has been changed to try to load safetensors files first, and only load the pt/pth/bin file if those aren't available.

… or pth files are available, attempt to load safetensors files

prusnak · 2023-05-03T08:30:10Z

        files = [file for glob in globs for file in path.glob(glob)]
+        if not files:
+            # Check if it's a set of safetensors files
+            globs = ["model-00001-of-*.safetensors"]


Why not just add "model-00001-of-*.safetensors" to the globs above?

That would generally work as well. I was thinking to treat it the same as the ggml files, where it's lower priority than the pth files, in case both exist. If I simply add the safetensors pattern to the globs, and both are present, the convert script will abort, since it's unsure which one to use.
In any case, I'm happy to change it if you prefer.

Hm, shouldn't the safetensors be preferred if anything?

That sounds good to me.
I initially wanted to minimize the behavior change, but I've updated the PR to try with the safetensors first.

… safetensors aren't available

* when loading a safetensors file, ignore the metadata header * check for safetensors files first, and only use PyTorch versions when safetensors aren't available

…1276)

* when loading a safetensors file, ignore the metadata header * check for safetensors files first, and only use PyTorch versions when safetensors aren't available

When loading a safetensors file, ignore the metadata header; If no pt…

b8279c8

… or pth files are available, attempt to load safetensors files

prusnak reviewed May 3, 2023

View reviewed changes

Check for safetensors files first, and only use PyTorch versions when…

d8c36c9

… safetensors aren't available

prusnak approved these changes May 8, 2023

View reviewed changes

prusnak merged commit 95078cc into ggml-org:master May 8, 2023

Bearsaerker mentioned this pull request Mar 12, 2025

Eval bug: Gemma 3 extremly slow prompt processing when using quantized kv cache. #12352

Closed

phuongncn pushed a commit to phuongncn/llama.cpp-gx10-dgx-sparks-deepseekv4 that referenced this pull request Apr 28, 2026

Faster CPU PP performance for Qwen3-Next - optimize concat (ggml-org#…

16fe459

…1276)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve ability to convert safetensors files.#1276

Improve ability to convert safetensors files.#1276
prusnak merged 2 commits intoggml-org:masterfrom
ubik2:master

ubik2 commented May 2, 2023 •

edited

Loading

Uh oh!

prusnak May 3, 2023

Uh oh!

ubik2 May 4, 2023 •

edited

Loading

Uh oh!

prusnak May 4, 2023 •

edited

Loading

Uh oh!

ubik2 May 8, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

ubik2 commented May 2, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

prusnak May 3, 2023

Choose a reason for hiding this comment

Uh oh!

ubik2 May 4, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

prusnak May 4, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ubik2 May 8, 2023

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ubik2 commented May 2, 2023 •

edited

Loading

ubik2 May 4, 2023 •

edited

Loading

prusnak May 4, 2023 •

edited

Loading