Skip to content

Grid reshape to increase hit rate#74

Closed
mmigdal-nv wants to merge 3 commits intoNVIDIA:tracking-matmulfrom
mmigdal-nv:grid_reshape
Closed

Grid reshape to increase hit rate#74
mmigdal-nv wants to merge 3 commits intoNVIDIA:tracking-matmulfrom
mmigdal-nv:grid_reshape

Conversation

@mmigdal-nv
Copy link
Collaborator

@mmigdal-nv mmigdal-nv commented Mar 27, 2023

Runtimes in ms:
Current tracking-matmul:

SWIZZLE_FACTOR=1 ./bin/nvfuser_tests --gtest_filter=*FusionAmpereMatmul_CUDA*
...
6.80659
5.72541
5.78003
5.336
5.7224
5.59946

With swizzle_factor:

SWIZZLE_FACTOR=4 ./bin/nvfuser_tests --gtest_filter=*FusionAmpereMatmul_CUDA*
...
5.14675
5.06474
5.08704
5.13798
5.12608
5.07923

@mmigdal-nv
Copy link
Collaborator Author

Implemented Here: #87 so closing this one.

@mmigdal-nv mmigdal-nv closed this Mar 29, 2023
@mmigdal-nv mmigdal-nv deleted the grid_reshape branch May 4, 2023 22:55
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant