-
Notifications
You must be signed in to change notification settings - Fork 4.5k
[MoE/ZeRO] Moe refactor with zero refactor #5821
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
Changes from all commits
Commits
Show all changes
54 commits
Select commit
Hold shift + click to select a range
f1d4167
[moe] removed openmoe-coupled code and rectify mixstral code (#5471)
FrankLeeeee df6826d
[Feauture] MoE refractor; Intergration with Mixtral (#5682)
Edenzzzz d49fd63
add mixtral auto policy & move pipeline forward code to modeling folder
Hz188 d2e07fc
[moe refactor] modify kernel test without Route Class
Hz188 7556b8f
[moe refactor] add moe tensor test path environment variable to githu…
Hz188 16329d5
fix typos
Hz188 b934437
fix moe test bug due to the code rebase
Hz188 a792e83
[moe refactor] fix moe zero test, and little bug in low level zero
Hz188 d203ba8
fix typo
Hz188 55c7416
add moe tensor path to github workflow
Hz188 8915e9d
remove some useless code
Hz188 7963fb0
fix typo & unify global variable XX_AXIS logic without using -1
Hz188 32ced74
fix typo & prettifier the code
Hz188 3100c1b
remove print code & support zero 2 test
Hz188 928ee39
remove useless code
Hz188 6dc0cfc
reanme function
Hz188 4417840
fix typo
Hz188 eb35655
fix typo
Hz188 d1d446b
Further improve the test code
Hz188 09a5188
remove print code
Hz188 4c6ea42
[moe refactor] change test model from fake moe model to mixtral moe l…
Hz188 80b6586
[moe refactor] skip some unit test which will be refactored later
Hz188 7d06220
[moe refactor] fix unit import error
Hz188 fb41f42
[moe refactor] fix circular import issues
Hz188 e99b69c
[moe refactor] remove debug code
Hz188 af9ade6
[moe refactor] update github workflow
Hz188 49d74f3
Merge pull request #5775 from Hz188/feature/moe
botbw d71ab10
[moe/zero] refactor low level optimizer (#5767)
botbw 88f318a
[Feature] MoE refactor with newest version of ZeRO (#5801)
Hz188 b2ac7e5
[zero] remove redundant members in BucketStore (#5802)
botbw 346a0df
[zero] align api with previous version
botbw a3a7d7d
Merge pull request #5811 from botbw/moe
botbw ba0115a
[Moe/Zero] Update MoeHybridParallelPlugin with refactored ZeRO and Fi…
Hz188 a10802e
[hotfix]Solve the compatibility issue of zero refactor (#5823)
Hz188 4cd4a1f
[zero] fix missing hook removal (#5824)
botbw 729388e
[MoE] Resolve .github conflict (#5829)
Hz188 d9ea6d4
[zero] fix hook bug
Hz188 b04e99c
Merge branch 'main' into feature/moe
Hz188 62cd25d
[zero] add low level optimizer back (#5839)
botbw 204d25c
[zero] comments and naming (#5840)
botbw efdfa06
[zero] modify api (#5843)
botbw 44aeccc
[test] fix (#5857)
botbw 9398484
[CI] skip openmoe CI check
Hz188 5e551f8
[CI] fox pre-commit
Hz188 2ff332c
[zero] remove redundant memebr init (#5862)
botbw 75be843
[misc] remove useless code, modify the pg mesh implementation
Hz188 1855442
Merge branch 'hpcaitech:feature/moe' into feature/moe
Hz188 3a25166
[misc] remove useless code, modify the pg mesh implementation
Hz188 502e514
[misc] use tempfile
Hz188 494b8a2
resolve conflict with main branch
Hz188 961e96f
resolve conflict with main branch
Hz188 95c4c0b
[misc] use tempfile in test_moe_checkpoint.py
Hz188 9e966b9
[misc] remove useless code, add assertion about sequence parallel, mo…
Hz188 165e894
[misc] remove useless code
Hz188 File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Empty file.
Empty file.
92 changes: 0 additions & 92 deletions
92
applications/ColossalMoE/colossal_moe/models/mixtral_layer.py
This file was deleted.
Oops, something went wrong.
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file was deleted.
Oops, something went wrong.
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
File renamed without changes.
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.