llama : add fd-based model loading via llama_model_load_from_fd ( REWORK ) by Siddhesh2377 · Pull Request #20402 · ggml-org/llama.cpp

Siddhesh2377 · 2026-03-11T14:28:28Z

Adds llama_model_load_from_fd() to load GGUF models from a POSIX file descriptor instead of a file path.

On Android, apps accessing user files through SAF only get a file descriptor, not a path. The alternative is copying the model into app storage or requesting MANAGE_EXTERNAL_STORAGE, which gets rejected by Google Play. This happened with my app (ToolNeuron).

Reworked version of a previous PR that was rejected for code quality.

Not supported on Windows. The fd is dup'd internally so the caller retains ownership.

Tested locally with CI and a real model (vocab_only + mmap).

JohannesGaessler · 2026-03-13T17:50:40Z


    GGML_API struct gguf_context * gguf_init_empty(void);
    GGML_API struct gguf_context * gguf_init_from_file(const char * fname, struct gguf_init_params params);
+    GGML_API struct gguf_context * gguf_init_from_fd(int fd, struct gguf_init_params params);


For your purposes, would it work to expose the current gguf_init_from_file_impl as gguf_init_from_file_ptr and to use that as the basis for the implementation instead? That way we would be able to also use this code on Windows in conjunction with ggml_fopen.

Done, replaced gguf_init_from_fd with gguf_init_from_file_ptr(FILE *) and moved the dup+fdopen logic up to llama-model-loader

The llama C API should also use a file pointer if at all possible, the conversion from file descriptor to file pointer should be in your user code.

Done, switched the llama C API to FILE pointer as well. The test shows the fd to FILE* conversion on the caller side.

JohannesGaessler

Maybe I should have made my intent more clear: The GGUF and llama APIs should be using a file pointer rather than a file descriptor because that has Windows compatibility. But then that is also what should be used internally because conversions between the two add unnecessary complexity. Your PR should not be using file descriptors anywhere, please consistently use file pointers.

JohannesGaessler · 2026-03-14T10:29:38Z

+struct gguf_context * gguf_init_from_file_ptr(FILE * file, struct gguf_init_params params) {
+    if (!file) {
+        return nullptr;
+    }
+    return gguf_init_from_file_impl(file, params);
+}
+


Just rename gguf_init_from_file_impl to gguf_init_from_file_ptr and add the check there. Keep the check in gguf_init_from_file since it is associated with a warning.

JohannesGaessler · 2026-03-14T10:30:13Z

              struct llama_model_params   params);

+    // Load a model from an open FILE pointer
+    LLAMA_API struct llama_model * llama_model_load_from_file_ptr(FILE * file, struct llama_model_params params);


Please keep the formatting consistent with the surrounding code.

Done, and sorry for the misunderstanding :)

JohannesGaessler · 2026-03-15T12:33:47Z

-    int file_id() const; // fileno overload
+    int file_id() const;


What did "fileno" refer to here and why did you remove this comment?

JohannesGaessler · 2026-03-15T12:41:53Z

-    GGML_ASSERT((metadata == nullptr) != path_model.empty() && "exactly one out of metadata and path_model needs to be defined");
+    if (metadata == nullptr && path_model.empty() && !file) {
+        LLAMA_LOG_ERROR("%s: no model source provided\n", __func__);
+        return nullptr;
+    }


The logic should remain that exactly one out of the three things needs to be defined. Something like this should work:

GGML_ASSERT(int(metadata != nullptr) + int(path_model.empty()) + int(file != nullptr) == 1 && "exactly one out of metadata and path_model needs to be defined");

JohannesGaessler · 2026-03-15T12:48:02Z

I am currently adding file saving/loading to the recently added end-to-end tests in test-llama-archs.cpp via #20503 . This will provide test coverage for your newly added code so I don't think we need this additional test.

llama : add fd-based model loading via llama_model_load_from_fd

158239a

Siddhesh2377 requested review from JohannesGaessler and ggerganov as code owners March 11, 2026 14:28

Siddhesh2377 changed the title ~~llama : add fd-based model loading via llama_model_load_from_fd~~ llama : add fd-based model loading via llama_model_load_from_fd ( REWORK ) Mar 11, 2026

github-actions Bot added testing Everything test related ggml changes relating to the ggml tensor library for machine learning labels Mar 11, 2026

JohannesGaessler mentioned this pull request Mar 13, 2026

llama: fix llama-model-saver #20503

Merged

JohannesGaessler reviewed Mar 13, 2026

View reviewed changes

Siddhesh2377 added 2 commits March 14, 2026 00:44

llama : address review feedback for fd-based model loading

a4cfaf0

llama : use FILE pointer instead of fd in public API

626823b

Siddhesh2377 requested a review from JohannesGaessler March 14, 2026 07:45

JohannesGaessler reviewed Mar 14, 2026

View reviewed changes

llama : use FILE pointer consistently, address review feedback

26c04d4

JohannesGaessler reviewed Mar 15, 2026

View reviewed changes

Siddhesh2377 closed this by deleting the head repository May 1, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llama : add fd-based model loading via llama_model_load_from_fd ( REWORK )#20402

llama : add fd-based model loading via llama_model_load_from_fd ( REWORK )#20402
Siddhesh2377 wants to merge 4 commits intoggml-org:masterfrom
Siddhesh2377:fd-loading

Siddhesh2377 commented Mar 11, 2026

Uh oh!

Uh oh!

JohannesGaessler Mar 13, 2026

Uh oh!

Siddhesh2377 Mar 13, 2026 •

edited

Loading

Uh oh!

JohannesGaessler Mar 13, 2026

Uh oh!

Siddhesh2377 Mar 13, 2026

Uh oh!

JohannesGaessler left a comment

Uh oh!

JohannesGaessler Mar 14, 2026

Uh oh!

JohannesGaessler Mar 14, 2026

Uh oh!

Siddhesh2377 Mar 14, 2026

Uh oh!

JohannesGaessler Mar 15, 2026

Uh oh!

JohannesGaessler Mar 15, 2026

Uh oh!

JohannesGaessler Mar 15, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Siddhesh2377 commented Mar 11, 2026

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Siddhesh2377 Mar 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

JohannesGaessler left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Siddhesh2377 Mar 13, 2026 •

edited

Loading