embedding: adjust `n_ubatch` value, print error on insufficient `n_batch` value by mscheong01 · Pull Request #6296 · ggml-org/llama.cpp

mscheong01 · 2024-03-25T11:24:45Z

updates to the embedding example based on discussion from #6193

assign n_batch value to u_batch
output error on insufficient batch size

ngxson · 2024-03-25T21:36:35Z

        if (inp.size() > n_batch) {
-            inp.resize(n_batch);
+            fprintf(stderr, "%s: error: number of tokens in input line (%lld) exceeds batch size (%lld), increase batch size and re-run\n",
+                    __func__, (long long int) inp.size(), (long long int) n_batch);


Here you can use %ld instead of %lld, no need to cast the type then

Suggested change

__func__, (long long int) inp.size(), (long long int) n_batch);

__func__, inp.size(), n_batch);

applied & rolled back due to build failure.
IIRC, this is why I used %lld with casting in #6193. Although I tried your suggestion just in case 😉.

We normally use PRIu64 / PRId64 to print 64-bit integers. Alternatively, in this case just %d and cast to (int) is fine

Co-authored-by: Xuan Son Nguyen <thichthat@gmail.com>

…tch`-overflow' of github.com:mscheong01/llama.cpp into embedding-assign-`n_ubatch`-value,-print-error-on-`n_batch`-overflow

This reverts commit ea753ed.

* embedding: assign `n_ubatch` value, print error on `n_batch` overflow * Update examples/embedding/embedding.cpp Co-authored-by: Xuan Son Nguyen <thichthat@gmail.com> * use %ld instead of %lld * Revert "use %ld instead of %lld" This reverts commit ea753ed. --------- Co-authored-by: Xuan Son Nguyen <thichthat@gmail.com>

embedding: assign n_ubatch value, print error on n_batch overflow

6e27406

ngxson requested changes Mar 25, 2024

View reviewed changes

mscheong01 and others added 4 commits March 26, 2024 09:25

Update examples/embedding/embedding.cpp

d054109

Co-authored-by: Xuan Son Nguyen <thichthat@gmail.com>

use %ld instead of %lld

ea753ed

Merge branch 'embedding-assign-n_ubatch-value,-print-error-on-`n_ba…

544b447

…tch`-overflow' of github.com:mscheong01/llama.cpp into embedding-assign-`n_ubatch`-value,-print-error-on-`n_batch`-overflow

Revert "use %ld instead of %lld"

2258098

This reverts commit ea753ed.

ggerganov approved these changes Mar 26, 2024

View reviewed changes

ggerganov merged commit deb7240 into ggml-org:master Mar 26, 2024

cebtenzzre mentioned this pull request May 28, 2024

llamamodel: fix embedding crash for >512 tokens after #2310 nomic-ai/gpt4all#2383

Merged

thxCode mentioned this pull request Aug 6, 2024

fix: crash on bge-m3 embedding model #8883

Open

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

embedding: adjust `n_ubatch` value, print error on insufficient `n_batch` value#6296

embedding: adjust `n_ubatch` value, print error on insufficient `n_batch` value#6296
ggerganov merged 5 commits intoggml-org:masterfrom
mscheong01:embedding-assign-`n_ubatch`-value,-print-error-on-`n_batch`-overflow

mscheong01 commented Mar 25, 2024

Uh oh!

Uh oh!

ngxson Mar 25, 2024

Uh oh!

mscheong01 Mar 26, 2024

Uh oh!

ggerganov Mar 26, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

	__func__, (long long int) inp.size(), (long long int) n_batch);
	__func__, inp.size(), n_batch);

Conversation

mscheong01 commented Mar 25, 2024

Uh oh!

Uh oh!

ngxson Mar 25, 2024

Choose a reason for hiding this comment

Uh oh!

mscheong01 Mar 26, 2024

Choose a reason for hiding this comment

Uh oh!

ggerganov Mar 26, 2024

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants