Fix: attempt to reduce the impact of a worst-case scenario on defragmentation by Xarbirus · Pull Request #6037 · ggml-org/llama.cpp

Xarbirus · 2024-03-13T10:33:09Z

Perhaps this update for the llama_kv_cache_defrag_internal function will help improve the handling of very large holes in the cache.

…rg#6037) * attempt to reduce the impact of a worst-case scenario * fragmentation calculation fix * Update llama.cpp --------- Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>

Xarbirus mentioned this pull request Mar 13, 2024

Checking llama's defrag_graph size #6019

Closed

Xarbirus added 2 commits March 13, 2024 21:55

attempt to reduce the impact of a worst-case scenario

97ad402

fragmentation calculation fix

38328bb

Xarbirus force-pushed the defrag-update branch from a71b695 to 38328bb Compare March 13, 2024 20:59

Xarbirus marked this pull request as ready for review March 13, 2024 21:01

ggerganov approved these changes Mar 14, 2024

View reviewed changes

Comment thread llama.cpp Outdated

Update llama.cpp

b88cd9f

ggerganov merged commit 2c4fb69 into ggml-org:master Mar 14, 2024

Xarbirus deleted the defrag-update branch April 17, 2024 10:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix: attempt to reduce the impact of a worst-case scenario on defragmentation#6037

Fix: attempt to reduce the impact of a worst-case scenario on defragmentation#6037
ggerganov merged 3 commits intoggml-org:masterfrom
Xarbirus:defrag-update

Xarbirus commented Mar 13, 2024

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Xarbirus commented Mar 13, 2024

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants