-
Notifications
You must be signed in to change notification settings - Fork 167
Dispatch combine #571
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
Dispatch combine #571
Conversation
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
valarLip
approved these changes
Jul 2, 2025
fsx950223
pushed a commit
that referenced
this pull request
Jul 2, 2025
* add dispatch combine * fix dispatch combine bug * add moe ep to test * format * ci * remove ck_moe * fix prebuild * update ck * update test for mori update * update * clear code * update test * add quant argument for test --------- Co-authored-by: valarLip <340077269@qq.com> Co-authored-by: TianDi101 <Di.Tian2@amd.com> Co-authored-by: amd-ruitang3 <Rui.Tang2@amd.com>
valarLip
added a commit
that referenced
this pull request
Jul 2, 2025
* refresh pa rocm * optimize performance * remove useless deps * adapt for gfx950 draft * pass v cache one test * fix mtp bugs * remove print * format code * fix a bug * support vary query length * remove torch deps * enable pa mtp * update api * remove useless template arg * optimize performance * revert changes * fix format * revert change * revert change * support head dim 256 * fix ci error * fix unit test * fix bugs * remove descrpted api * remove comments * fix api * fix alibi slopes * fix polential bug * support gqa32 * strip archs * add pa v1 api * update pa * add paged attention v * optimize performance * fix fp8 vlds * add debug log * add env * add logs to utils * fix pa v1 fp8 accuracy issue * optimize performance a little * optimize perf a little * fix a bug * fix workspace bug * refactor pa ragged * fix format * remove useless file * remove useless file * remove useless code * robust utils * add rocm6.4 support * fix type hint * fix pa rocm unit test * adapt to latest triton * Dispatch combine (#571) * add dispatch combine * fix dispatch combine bug * add moe ep to test * format * ci * remove ck_moe * fix prebuild * update ck * update test for mori update * update * clear code * update test * add quant argument for test --------- Co-authored-by: valarLip <340077269@qq.com> Co-authored-by: TianDi101 <Di.Tian2@amd.com> Co-authored-by: amd-ruitang3 <Rui.Tang2@amd.com> --------- Co-authored-by: chenjun <46212055+junhaha666@users.noreply.github.com> Co-authored-by: valarLip <340077269@qq.com> Co-authored-by: TianDi101 <Di.Tian2@amd.com> Co-authored-by: amd-ruitang3 <Rui.Tang2@amd.com>
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
No description provided.