Skip to content

examples : evaluate tokens in batches after swapping context#1014

Merged
ggerganov merged 2 commits intoggml-org:masterfrom
grencez:batching
Apr 21, 2023
Merged

examples : evaluate tokens in batches after swapping context#1014
ggerganov merged 2 commits intoggml-org:masterfrom
grencez:batching

Conversation

@grencez
Copy link
Copy Markdown
Contributor

@grencez grencez commented Apr 16, 2023

This new loop around llama_eval is a bit redundant with the batching done in the main loop, but without a refactor it's all still necessary to keep print statements happening at the right times.

@grencez grencez force-pushed the batching branch 5 times, most recently from 26748b2 to 3bc0a89 Compare April 16, 2023 10:22
@grencez grencez changed the title Evaluate tokens in batches after swapping context examples: Evaluate tokens in batches after swapping context Apr 16, 2023
@grencez grencez changed the title examples: Evaluate tokens in batches after swapping context examples : evaluate tokens in batches after swapping context Apr 16, 2023
@grencez grencez marked this pull request as ready for review April 16, 2023 10:30
@grencez
Copy link
Copy Markdown
Contributor Author

grencez commented Apr 17, 2023

Tests passed yesterday. I just synced recent changes and added a comment.

Comment thread examples/main/main.cpp Outdated
@ggerganov ggerganov merged commit 9411288 into ggml-org:master Apr 21, 2023
@grencez grencez deleted the batching branch April 21, 2023 21:09
Seunghhon pushed a commit to Seunghhon/llama.cpp that referenced this pull request Apr 26, 2026
* examples : evaluate tokens in batches after swapping context

* Update examples/main/main.cpp

---------

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
phuongncn pushed a commit to phuongncn/llama.cpp-gx10-dgx-sparks-deepseekv4 that referenced this pull request Apr 28, 2026
* examples : evaluate tokens in batches after swapping context

* Update examples/main/main.cpp

---------

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>
phuongncn pushed a commit to phuongncn/llama.cpp-gx10-dgx-sparks-deepseekv4 that referenced this pull request Apr 28, 2026
Co-authored-by: Iwan Kawrakow <iwan.kawrakow@gmail.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants