add missing kv clear in llama_beam_search by dwrensha · Pull Request #6664 · ggml-org/llama.cpp

dwrensha · 2024-04-13T18:46:05Z

Adds a call to llama_kv_cache_seq_rm() in llama_beam_search_data::fill_next_beams_by_top_probabilities().

This seems to be necessary for the subsequent llama_decode() calls -- which can act on the same sequence at the same position with different test tokens -- to return reasonable results.

Before this change:

$ bin/beam-search ~/models/mistral-7b-v0.1.Q8_0.gguf 3 "fibonacci: 1, 1, 2, 3, 5, 8, 13,"
...
 21, 341, 610, 987, 1597, 233

After this change:

$ bin/beam-search ~/models/mistral-7b-v0.1.Q8_0.gguf 3 "fibonacci: 1, 1, 2, 3, 5, 8, 13,"
...
 21, 34, 55, 89, 144, 233, 377, 610, 987, 1597, 2584, 4181, 6765, 10946, 17711, 28657, 46368, 75025, 121393, 196418, 317811, 514229, 832040, 1346269, 2178309, 3524578, 5702887, 9227465, 14930352, 24157817, 39088169, 63245986, 102334155, 165580141, 26791429

dwrensha · 2024-04-13T18:46:43Z

cc @mattpulver, who added beam search in #2267.

compilade

Nice catch!

Throwing an idea here for later: I think the beam search should eventually be adapted to use one seq_id per beam to facilitate the logic for sharing KV cells between beams, and to allow parallel beam search.

dwrensha · 2024-04-14T18:56:07Z

I think the beam search should eventually be adapted to use one seq_id per beam to facilitate the logic for sharing KV cells between beams, and to allow parallel beam search.

I agree!

add missing kv clear in llama_beam_search

e969ada

compilade approved these changes Apr 14, 2024

View reviewed changes

compilade merged commit 1958f7e into ggml-org:master Apr 14, 2024

dwrensha deleted the fix-beam-search branch April 14, 2024 19:26

cebtenzzre mentioned this pull request Apr 15, 2024

Add beam search abetlen/llama-cpp-python#631

Closed

Seunghhon pushed a commit to Seunghhon/llama.cpp that referenced this pull request Apr 26, 2026

llama : add missing kv clear in llama_beam_search (ggml-org#6664)

515433e

phuongncn pushed a commit to phuongncn/llama.cpp-gx10-dgx-sparks-deepseekv4 that referenced this pull request Apr 28, 2026

llama : add missing kv clear in llama_beam_search (ggml-org#6664)

3aca280

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add missing kv clear in llama_beam_search#6664

add missing kv clear in llama_beam_search#6664
compilade merged 1 commit intoggml-org:masterfrom
dwrensha:fix-beam-search

dwrensha commented Apr 13, 2024

Uh oh!

dwrensha commented Apr 13, 2024 •

edited

Loading

Uh oh!

compilade left a comment

Uh oh!

dwrensha commented Apr 14, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

dwrensha commented Apr 13, 2024

Uh oh!

dwrensha commented Apr 13, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

compilade left a comment

Choose a reason for hiding this comment

Uh oh!

dwrensha commented Apr 14, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

dwrensha commented Apr 13, 2024 •

edited

Loading