llama : fix `llama_chat_format_single` for mistral by ngxson · Pull Request #8657 · ggml-org/llama.cpp

ngxson · 2024-07-23T19:45:27Z

Resolve #8655

Fix llama_chat_format_single incorrectly format system message.

Also added some logs and test cases for this.

The output with this PR:

[1721763727] formatted: [INST] You are an assistant

[1721763727] tokenize the prompt
[1721763727] prompt: "[INST] You are an assistant
"
[1721763727] tokens: [ '<s>':1, '[INST]':3, ' You':3213, ' are':1584, ' an':1420, ' assistant':27089, '':1010 ]

...

[1721763730] buffer: 'hello
'
[1721763730] formatted: 
hello
 [/INST]
[1721763730] input tokens: [ '':1010, 'hello':29706, '':1010, ' ':1032, '[/INST]':4 ]

I have read the contributing guidelines
Self-reported review complexity:
- Low
- Medium
- High

HanClinto

Looks good to me!

The added test is very good. Without the fix applied, I can confirm that the problem is exhibited (and the test fails), and then after applying the fix, the test passes, and behavior seems to match what's expected.

Can't ask for more than that! Nice fix! :)

ggerganov · 2024-07-24T09:29:00Z

+
+
+    // test llama_chat_format_single for user message
+    std::cout << "\n\n=== llama_chat_format_single (user message) ===\n\n";


nit : prefer printf over std::cout, mainly for consistency with the rest of the code

Yup that makes sense, I changed to printf : 20f56a0

Will merge after CI pass

* fix `llama_chat_format_single` for mistral * fix typo * use printf

fix llama_chat_format_single for mistral

f70331e

ngxson requested a review from ggerganov July 23, 2024 19:45

github-actions Bot added testing Everything test related examples labels Jul 23, 2024

fix typo

2538411

HanClinto approved these changes Jul 23, 2024

View reviewed changes

ngxson changed the title ~~fix llama_chat_format_single for mistral~~ llama : fix llama_chat_format_single for mistral Jul 23, 2024

ggerganov approved these changes Jul 24, 2024

View reviewed changes

use printf

20f56a0

ngxson added the merge ready A maintainer can use this label to indicate that they consider the changes final and ready to merge. label Jul 24, 2024

ngxson merged commit 96952e7 into ggml-org:master Jul 24, 2024

arthw pushed a commit to arthw/llama.cpp that referenced this pull request Jul 27, 2024

llama : fix llama_chat_format_single for mistral (ggml-org#8657)

dc7836c

* fix `llama_chat_format_single` for mistral * fix typo * use printf

Seunghhon pushed a commit to Seunghhon/llama.cpp that referenced this pull request Apr 26, 2026

llama : fix llama_chat_format_single for mistral (ggml-org#8657)

b0c21e1

* fix `llama_chat_format_single` for mistral * fix typo * use printf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llama : fix `llama_chat_format_single` for mistral#8657

llama : fix `llama_chat_format_single` for mistral#8657
ngxson merged 3 commits intoggml-org:masterfrom
ngxson:xsn/fix_mistral_chat_format

ngxson commented Jul 23, 2024

Uh oh!

HanClinto left a comment

Uh oh!

ggerganov Jul 24, 2024

Uh oh!

ngxson Jul 24, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants



		// test llama_chat_format_single for user message
		std::cout << "\n\n=== llama_chat_format_single (user message) ===\n\n";

Conversation

ngxson commented Jul 23, 2024

Uh oh!

HanClinto left a comment

Choose a reason for hiding this comment

Uh oh!

ggerganov Jul 24, 2024

Choose a reason for hiding this comment

Uh oh!

ngxson Jul 24, 2024

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants