-
Notifications
You must be signed in to change notification settings - Fork 4.5k
[Pipeline inference] Modify to tieweight #4599
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
FoolPlayer
merged 21 commits into
hpcaitech:feature/pipeline-infer
from
FoolPlayer:ppinfer-tieweight
Sep 7, 2023
Merged
Changes from all commits
Commits
Show all changes
21 commits
Select commit
Hold shift + click to select a range
59c13a3
add pp stage manager as circle stage
FoolPlayer 56cabee
fix a bug when create process group
FoolPlayer 110cf72
add ppinfer basic framework
FoolPlayer 99b740c
add micro batch manager and support kvcache-pp gpt2 fwd
FoolPlayer b023f59
add generate schedule
FoolPlayer 63b71d1
use mb size to control mb number
FoolPlayer 7e0c63f
support generate with kv cache
FoolPlayer 367bbb6
add output, remove unused code
FoolPlayer e2911e4
add test
FoolPlayer 69dc70e
Merge branch 'feature/shardformer' of https://github.com/hpcaitech/Co…
FoolPlayer f3b6122
reuse shardformer to build model
FoolPlayer 33a9cf8
refactor some code and use the same attribute name of hf
FoolPlayer ba01e8a
fix review and add test for generation
FoolPlayer 0e3a5c1
remove unused file
FoolPlayer 971119e
modify the way of saving newtokens
FoolPlayer c1b284e
modify to tieweight
FoolPlayer d878cba
Merge branch 'feature/pipeline-infer' of https://github.com/hpcaitech…
FoolPlayer 7d530df
modify test
FoolPlayer c6373e5
remove unused file
FoolPlayer 02b8317
solve review
FoolPlayer e8474cc
add docstring
FoolPlayer File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.