llama : avoid double token-to-piece cache by ggerganov · Pull Request #7654 · ggml-org/llama.cpp

ggerganov · 2024-05-30T19:12:44Z

ggml-ci

llama : avoid double token-to-piece cache

42859a5

ggml-ci

mofosyne added the Review Complexity : Medium Generally require more time to grok but manageable by beginner to medium expertise level label May 31, 2024

ggerganov merged commit 549279d into master Jun 3, 2024

ggerganov deleted the gg/cache-no-special branch June 3, 2024 05:34

Seunghhon pushed a commit to Seunghhon/llama.cpp that referenced this pull request Apr 26, 2026

llama : avoid double token-to-piece cache (ggml-org#7654)

8443386

ggml-ci

phuongncn pushed a commit to phuongncn/llama.cpp-gx10-dgx-sparks-deepseekv4 that referenced this pull request Apr 28, 2026

llama : avoid double token-to-piece cache (ggml-org#7654)

f9edcd7

ggml-ci

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llama : avoid double token-to-piece cache#7654

llama : avoid double token-to-piece cache#7654
ggerganov merged 1 commit intomasterfrom
gg/cache-no-special

ggerganov commented May 30, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

ggerganov commented May 30, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants