-
Notifications
You must be signed in to change notification settings - Fork 33.1k
add rotary kernel support to Qwen3 model #41147
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
Changes from all commits
Commits
Show all changes
43 commits
Select commit
Hold shift + click to select a range
69f2ca8
add rotary kernel support to Qwen3 model
kaixuanliu d2bf5c5
delete unnecessary import
kaixuanliu b0cbab5
adjust code
kaixuanliu 8dede65
adjust code
kaixuanliu 5c02189
Merge branch 'rotary-kernel' of https://github.com/kaixuanliu/transfo…
kaixuanliu 137069b
put get rotary kernel to hub_kernels.py
kaixuanliu 8ac3e1e
fix wrong import
kaixuanliu 29f83f2
refine code and adjust related modular code
kaixuanliu 7729b7f
Merge branch 'main' into rotary-kernel
kaixuanliu 94e4f60
fix modular mismatch bug
kaixuanliu b96a7c9
Merge branch 'rotary-kernel' of https://github.com/kaixuanliu/transfo…
kaixuanliu aebac76
Merge branch 'main' into rotary-kernel
kaixuanliu af67a74
update code, use lazy load kernels
kaixuanliu 28c69d3
fix check modular conversion issue
kaixuanliu cff1580
Merge branch 'main' into rotary-kernel
kaixuanliu 7ac16a1
fix CI bug for qwen3-next
kaixuanliu adce121
fix CI issue
kaixuanliu b4757f4
delete unused code
kaixuanliu 5ff6b16
Merge branch 'main' into rotary-kernel
kaixuanliu 3bbfd64
rename to `apply_rotary_transformers`
kaixuanliu ec4ca1d
Merge branch 'main' into rotary-kernel
kaixuanliu caa549a
adjust import `lazy_load_kernel` location
kaixuanliu 08a9959
Update modular-generated modeling files with lazy_load_kernel import …
kaixuanliu 7915fc1
fix conflicts
kaixuanliu 6f2d958
add more check
kaixuanliu e67ca60
Merge branch 'main' into rotary-kernel
kaixuanliu 36c2205
Merge branch 'main' into rotary-kernel
kaixuanliu f4b12a7
use decorator to map kernels for functions
kaixuanliu 6a3e6f3
small fix
kaixuanliu 702dc09
small adjustment
kaixuanliu fdabd60
Merge branch 'main' into rotary-kernel
kaixuanliu fe2bf42
update code
kaixuanliu 6f95969
fix LINT issue
kaixuanliu 2ebea7b
Merge branch 'main' into rotary-kernel
kaixuanliu e80cfd3
update code to adapt to new `use_kernel_func_from_hub` API in kernels
kaixuanliu c771acd
do not consider check_modular first
kaixuanliu d916ef0
update
kaixuanliu 8670efe
fix
kaixuanliu fe20bd5
add compatibility for old version `kernels`
kaixuanliu 898e36e
add rotary fn kernel to all models
kaixuanliu b8b68c7
update modular part
kaixuanliu af25ce0
Revert "update modular part"
kaixuanliu 4b9da30
update code
kaixuanliu File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@MekkCyber we need some docs here on usage etc!