llama : set attrs of mislabelled EOT/EOM tokens by bakkot · Pull Request #9348 · ggml-org/llama.cpp

bakkot · 2024-09-07T12:52:43Z

I have read the contributing guidelines
Self-reported review complexity:
- Low
- Medium
- High

Some models (such as the official phi-3-mini-4k-instruct-gguf) have an <|end|> or similar token which is detected as being EOT or EOM but which is not marked as control.

This causes issues where the token will be included in the output when it shouldn't. A complete reproduction is to install the phi-3-mini-4k-instruct-gguf linked above and then run

./llama-cli -m ./models/Phi-3-mini-4k-instruct-q4.gguf --prompt "What kind of thing is a llama? Response: " --no-display-prompt --temp 0 --grammar 'root ::= "animal" | "object"'

which will print animal<|end|>.

In these cases, manually set the token's attr to LLAMA_TOKEN_ATTR_CONTROL so that we know not to print it. That fixes the above repro to correctly print just animal.

compilade

Note that reconverting the Phi-3 models since #8228 should mark <|end|> as CONTROL.

But we have no control over the "official" models.

llama : set attrs of mislabelled EOT/EOM tokens

91695ad

compilade approved these changes Sep 8, 2024

View reviewed changes

ggerganov merged commit fbb7fcf into ggml-org:master Sep 8, 2024

bakkot deleted the handle-mislabeled-control-tokens branch September 8, 2024 11:59

dsx1986 pushed a commit to dsx1986/llama.cpp that referenced this pull request Oct 29, 2024

llama : set attrs of mislabelled EOT/EOM tokens (ggml-org#9348)

66898d7

arthw pushed a commit to arthw/llama.cpp that referenced this pull request Nov 15, 2024

llama : set attrs of mislabelled EOT/EOM tokens (ggml-org#9348)

1354b5f

arthw pushed a commit to arthw/llama.cpp that referenced this pull request Nov 18, 2024

llama : set attrs of mislabelled EOT/EOM tokens (ggml-org#9348)

040d730

Seunghhon pushed a commit to Seunghhon/llama.cpp that referenced this pull request Apr 26, 2026

llama : set attrs of mislabelled EOT/EOM tokens (ggml-org#9348)

11e5746

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llama : set attrs of mislabelled EOT/EOM tokens#9348

llama : set attrs of mislabelled EOT/EOM tokens#9348
ggerganov merged 1 commit intoggml-org:masterfrom
bakkot:handle-mislabeled-control-tokens

bakkot commented Sep 7, 2024 •

edited

Loading

Uh oh!

compilade left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

bakkot commented Sep 7, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

compilade left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

bakkot commented Sep 7, 2024 •

edited

Loading