-
Notifications
You must be signed in to change notification settings - Fork 33.1k
[refactor] Serving into proper modules #44796
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
Changes from all commits
Commits
Show all changes
76 commits
Select commit
Hold shift + click to select a range
d8e7c45
new serve file
SunMarc f238867
app
SunMarc be0291d
model_manager done
SunMarc e84d82e
update serve
SunMarc fb77305
style
SunMarc d869d62
poc done
SunMarc 5aadd1a
Merge remote-tracking branch 'origin/main' into refactor-serving
SunMarc bd734e8
renaming
SunMarc 69d3264
fix
SunMarc f5afd6c
new tests
SunMarc fedad8e
update metrics and processor
SunMarc 9b904b1
hardcode n_batch for now
SunMarc 0084b91
add response api + compile
SunMarc 1d5d1cb
more tests
SunMarc 3d64a8c
add it for now but we will move it
SunMarc 74b3593
Merge remote-tracking branch 'origin/main' into refactor-serving
SunMarc 552603c
remove cache impl
SunMarc 3643ece
add back load_model
SunMarc 12c0f55
fix naming
SunMarc d4ffdf4
add transcription
SunMarc 68cd5bc
tool calls better !
SunMarc 6da3f3c
vlm support for both response and chat endpoints
SunMarc a92ebe2
update bench
SunMarc 76a5c83
fix vl test
SunMarc 31e59c3
first iteration of cb
SunMarc 962d039
cb tests
SunMarc 13945c1
typing + review
SunMarc 4abb194
update test
SunMarc 1658981
better benchmark
SunMarc 720ecdb
better stream
SunMarc 4424635
update bench
SunMarc 7d0cd77
fix
SunMarc 533233c
serve refactored
SunMarc 880e6e0
merge
SunMarc 4aa7fec
update
SunMarc 3ab4e09
fix
SunMarc 06bacbb
style
SunMarc ef10618
simpler
SunMarc 09d5fe1
style
SunMarc 96b6b8b
update warmup
SunMarc 07ecd2a
remove llamacpp integration for now
SunMarc fad7c25
styke
SunMarc feed4cb
styke
SunMarc abd4087
style again
SunMarc 120e37b
Merge branch 'main' into output-callback-cb
SunMarc d550b9b
remove annoattion
SunMarc ca06e2b
Merge branch 'main' into refactor-serving
SunMarc ac0d6a1
review !
SunMarc 66314b5
Merge remote-tracking branch 'origin/main' into output-callback-cb
SunMarc 9d52002
style
SunMarc c48aec3
much cleaner
SunMarc b13dacc
renamed
SunMarc 7855606
remove bench for now
SunMarc ef1c710
batch output
SunMarc caaab6e
style
SunMarc 4c1cd01
type
SunMarc 702ff74
better tests
SunMarc 80b5c78
update test
SunMarc a8461fc
queue draining
SunMarc 480828d
Merge branch 'main' into output-callback-cb
SunMarc cb83702
Merge remote-tracking branch 'origin/main' into refactor-serving
SunMarc 9db52a0
add seed
SunMarc 9485f68
Merge branch 'main' into refactor-serving
SunMarc 160b2f6
Merge remote-tracking branch 'origin/main' into refactor-serving
SunMarc 40417ee
some logs
SunMarc 3bd6a09
Merge remote-tracking branch 'origin/main' into refactor-serving
SunMarc ced96c2
readd nathan feature + some minor fixes
SunMarc ff02cd7
fix
SunMarc 307498e
guard transcription
SunMarc ffe4c64
better now
SunMarc 06a7881
fix
SunMarc 052cbc7
adding lock to see if this helps
SunMarc 6799727
remove locks
SunMarc 3a07c86
lock again
SunMarc 7a7abf2
update bench and remove lock for now
SunMarc bbd5cb0
Merge branch 'main' into refactor-serving
SunMarc File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.