-
Notifications
You must be signed in to change notification settings - Fork 4.5k
[gemini] gemini support tensor parallelism. #4942
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
Changes from all commits
Commits
Show all changes
47 commits
Select commit
Hold shift + click to select a range
dc0dc0b
[colossalai]fix typo
flybird11111 dd59ca2
[inference] Add smmoothquant for llama (#4904)
Xu-Kai 52707c6
Update flash_attention_patch.py
Orion-Zheng 61ec9f7
[kernel] support pure fp16 for cpu adam and update gemini optim tests…
ver217 561553b
[format] applied code formatting on changed files in pull request 490…
github-actions[bot] 8d42002
[gemini] support gradient accumulation (#4869)
da55732
[hotfix] fix torch 2.0 compatibility (#4936)
ver217 775ea1b
[test] add no master test for low level zero plugin (#4934)
KKZ20 0074178
[format] applied code formatting on changed files in pull request 482…
github-actions[bot] 907aa98
[nfc] fix some typo with colossalai/ docs/ etc. (#4920)
digger-yu 31fddbc
[Refactor] Integrated some lightllm kernels into token-attention (#4…
tiandiao123 8633a87
[test] merge old components to test to model zoo (#4945)
ver217 9d543af
[inference] add reference and fix some bugs (#4937)
Xu-Kai fe79560
[Inference]ADD Bench Chatglm2 script (#4963)
CjhHa1 a610046
[Pipeline inference] Combine kvcache with pipeline inference (#4938)
FoolPlayer 3b8137d
updated c++17 compiler flags (#4983)
kurisusnowdeng 9fce43b
[Inference] Dynamic Batching Inference, online and offline (#4953)
CjhHa1 62eb99f
[Kernels]Updated Triton kernels into 2.1.0 and adding flash-decoding …
tiandiao123 fa1cbd3
fix ColossalEval (#4992)
chengeharrison 3209431
[doc]Update doc for colossal-inference (#4989)
tiandiao123 f0482f4
[hotfix] Fix the bug where process groups were not being properly rel…
littsk cd8ad65
[hotfix] fix the bug of repeatedly storing param group (#4951)
5266946
[doc] add supported feature diagram for hybrid parallel plugin (#4996)
ppt0011 ab8468c
[Pipeline Inference] Merge pp with tp (#4993)
FoolPlayer f9c1920
[release] update version (#4995)
ver217 2043b9d
[gemini] gemini support tp
flybird11111 da1915d
fix
flybird11111 9fd9e69
update checkpointIO
flybird11111 a89f2fd
support fused layernorm
flybird11111 2406cb0
update fusedlayernorm
flybird11111 a0509a6
add sequence parallel to gemini
flybird11111 12cd780
fix
flybird11111 0110902
fix comments
flybird11111 86a5eca
fix
flybird11111 6f13876
fix t5
flybird11111 5f16e4f
clear cache
flybird11111 adead50
fix
flybird11111 ed825dc
activate ci
flybird11111 37494c3
activate ci
flybird11111 73da4ca
fix
flybird11111 cf2bc63
fix
flybird11111 6c85a9e
fix
flybird11111 8dd4b41
fix
flybird11111 3d8319e
revert
flybird11111 66ffed5
modify tp gather method
flybird11111 c40c459
fix test
flybird11111 bc575a2
Merge branch 'main' into gemini-tp
flybird11111 File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.