Skip to content

server: coherent log output for KV cache full#6637

Merged
ggerganov merged 1 commit intomasterfrom
hp/server/coherent-logs
Apr 12, 2024
Merged

server: coherent log output for KV cache full#6637
ggerganov merged 1 commit intomasterfrom
hp/server/coherent-logs

Conversation

@phymbert
Copy link
Copy Markdown
Collaborator

Motivation

Some logs are using llama log method instead of the server one.

Changes

  • Use server logs dedicated method when KV Cache is full.

Still some to update during slots context shifting, but it can be done later on.

Concerns

One would want to log also server logs in llama.log but it can be done later.

References

@ggerganov ggerganov merged commit 24ee66e into master Apr 12, 2024
@phymbert phymbert deleted the hp/server/coherent-logs branch April 12, 2024 11:55
Seunghhon pushed a commit to Seunghhon/llama.cpp that referenced this pull request Apr 26, 2026
phuongncn pushed a commit to phuongncn/llama.cpp-gx10-dgx-sparks-deepseekv4 that referenced this pull request Apr 28, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants