llama: max ctx by default, fix fit magic number by JohannesGaessler · Pull Request #18567 · ggml-org/llama.cpp

JohannesGaessler · 2026-01-03T17:37:22Z

The intent with llama_params_fit is to only change those parameters that a user is not setting explicitly themself. However, if a user is explicitly setting -c 0 this is currently being changed by llama_params_fit. I would propose the following changes to fix this:

Change the data type of llama_context_params::n_ctx from uint32_t to int32_t with all values <= 0 being interpreted to mean that the full context should be used.
Change the default value set llama_context_default_params from 512 to -1. This would mean with the llama C API by default the full context of the model would now be used, consistent with the common API.
In llama_params_fit, check the context size against the new default value instead of vs. 0.
For the CLI, add strings "auto" and "full" for --ctx-size which by default both use the full context but differ w.r.t. whether they should be changed by llama_params_fit.

An alternative approach would be keep the changes entirely in the common API.

CISC · 2026-01-03T19:01:31Z

Hmmm, what's with the sign disparity between common and llama params?

JohannesGaessler · 2026-01-03T21:41:07Z

I think the biggest reason is simply that no one invested the effort to make things consistent. I would suggest that going forward, if we have non-negative quantities like the context size we store them internally as unsigned values but use signed values in llama_model_params and llama_context_params in order to encode special values.

ggerganov

An alternative approach would be keep the changes entirely in the common API.

This seems like the better approach.

ggerganov · 2026-01-21T06:41:13Z

    //       https://github.com/ggml-org/llama.cpp/pull/7544
    struct llama_context_params {
-        uint32_t n_ctx;             // text context, 0 = from model
+        int32_t  n_ctx;             // context size in tokens, use llama_model_n_ctx_train for values <= 0


Hm, I don't see why this change is needed. Looks like it allows negative values that have the same functionality as n_ctx == 0. So the user code can simply used params.n_ctx = max(0, n_ctx); to achieve the same.

JohannesGaessler · 2026-01-27T13:10:49Z

Superseded by #19070 .

JohannesGaessler requested a review from ggerganov as a code owner January 3, 2026 17:37

github-actions Bot added examples testing Everything test related labels Jan 3, 2026

loci-dev mentioned this pull request Jan 3, 2026

UPSTREAM PR #18567: llama: max ctx by default, fix fit magic number auroralabs-loci/llama.cpp#797

Open

JohannesGaessler added 3 commits January 20, 2026 20:39

llama: max ctx by default, fix fit magic number

de02aa2

explicitly set context in tokenizer tests

8fccef3

fix crash on positive default

bf2ee7b

JohannesGaessler force-pushed the llama-fp-fix-ctx-magic branch from 60db254 to bf2ee7b Compare January 20, 2026 20:50

ggerganov reviewed Jan 21, 2026

View reviewed changes

JohannesGaessler mentioned this pull request Jan 24, 2026

llama-fit-params: keep explicit --ctx-size 0 #19070

Merged

loci-dev mentioned this pull request Jan 24, 2026

UPSTREAM PR #19070: llama-fit-params: keep explicit --ctx-size 0 auroralabs-loci/llama.cpp#1021

Open

JohannesGaessler closed this Jan 27, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llama: max ctx by default, fix fit magic number#18567

llama: max ctx by default, fix fit magic number#18567
JohannesGaessler wants to merge 3 commits intoggml-org:masterfrom
JohannesGaessler:llama-fp-fix-ctx-magic

JohannesGaessler commented Jan 3, 2026

Uh oh!

CISC commented Jan 3, 2026

Uh oh!

JohannesGaessler commented Jan 3, 2026

Uh oh!

ggerganov left a comment

Uh oh!

ggerganov Jan 21, 2026

Uh oh!

JohannesGaessler commented Jan 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

JohannesGaessler commented Jan 3, 2026

Uh oh!

CISC commented Jan 3, 2026

Uh oh!

JohannesGaessler commented Jan 3, 2026

Uh oh!

ggerganov left a comment

Choose a reason for hiding this comment

Uh oh!

ggerganov Jan 21, 2026

Choose a reason for hiding this comment

Uh oh!

JohannesGaessler commented Jan 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants