Skip to content

bench: server add stop word for PHI-2#6916

Merged
phymbert merged 1 commit intomasterfrom
hp/server/bench/fix-stop-word
Apr 26, 2024
Merged

bench: server add stop word for PHI-2#6916
phymbert merged 1 commit intomasterfrom
hp/server/bench/fix-stop-word

Conversation

@phymbert
Copy link
Copy Markdown
Collaborator

Context

Since we properly support BOS/EOT with llama_token_is_eog and removed hardcoded stop words in utils.hpp, the phi-2 base model never ends generation. So one can think the benchmark figures decrease.

This temporary fix before switching to an instruct model in the server benchmark.

References

@phymbert phymbert requested a review from ggerganov April 26, 2024 07:06
@phymbert phymbert added performance Speed related topics build Compilation issues server/webui labels Apr 26, 2024
@phymbert phymbert merged commit 5790c8d into master Apr 26, 2024
@phymbert phymbert deleted the hp/server/bench/fix-stop-word branch April 26, 2024 07:26
@ggerganov ggerganov mentioned this pull request Apr 26, 2024
Seunghhon pushed a commit to Seunghhon/llama.cpp that referenced this pull request Apr 26, 2026
phuongncn pushed a commit to phuongncn/llama.cpp-gx10-dgx-sparks-deepseekv4 that referenced this pull request Apr 28, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

build Compilation issues performance Speed related topics server/webui

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants