-
Notifications
You must be signed in to change notification settings - Fork 79
add epilogue to store MMA results in shared memory before write to #387
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
Changes from all commits
Commits
Show all changes
36 commits
Select commit
Hold shift + click to select a range
379eab5
add epilogue to store MMA results in shared memory before write to
liqiangxl b500d36
revise test
liqiangxl acf1167
format
liqiangxl be885b0
swizzleSharedMemory
liqiangxl 7138cbe
format
liqiangxl fad4ad8
fix failed test cases
liqiangxl 84e3e98
propagate to epilogue tensors
liqiangxl a94e5df
check num_shared_mem_tensors
liqiangxl 9f4bcc4
format
liqiangxl 1760c15
disable_smem_epilogue
liqiangxl f0ff6f9
extend MatmulSASSTest
liqiangxl da5dc3a
schedule output tensor
liqiangxl 537b855
wip
liqiangxl ab86a1f
use propagate
liqiangxl f2a75cd
fix failed case
liqiangxl 7a4d5b5
fix ci fails by increasing tolerance:x
liqiangxl 86b8911
merge main
liqiangxl 5586b3a
fix failed cases
liqiangxl 6a8f139
trivial fix
liqiangxl d6212cb
format
liqiangxl 95ea553
revise hasEnoughSharedMemoryForEpilogue
liqiangxl 925e04d
merge main
liqiangxl 1f30a36
wip
liqiangxl 80d7588
cacheAfter mma_result
liqiangxl d3019f0
add epilogue cast and relu tests
liqiangxl 212258c
trivial fix
liqiangxl a2045cd
mma data types
liqiangxl 32f43d8
merge main
liqiangxl 67ecdb0
revise smem swizzle
liqiangxl 864a918
test with revised swizzle
liqiangxl 79c452c
merge main
liqiangxl 26d970d
save file
liqiangxl 189cef4
revise based on review comments
liqiangxl 889ce23
rename
liqiangxl 94aae9b
change default to false
liqiangxl 27870d0
Merge branch 'main' into llu/matmul_epilogue
liqiangxl File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
Large diffs are not rendered by default.
Oops, something went wrong.
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.