Skip to content

Conversation

@Hzfengsy
Copy link
Member

@Hzfengsy Hzfengsy commented Feb 1, 2023

This PR adds Tensor Core intrinsics with shared.dyn scope and changes the default rules to use shared.dyn.

Here are the performance improvement of GEMM 1024x1024x1024 on my device (RTX-3080)

Use shared Use shared.dyn. Speedup
fp 16-16-16 66399.8766 GFLOPs 71778.3808 GFLOPs 8.1%
fp 16-16-32 44292.5893 GFLOPs 49070.2514 GFLOPS 10.8%

cc @vinx13 @junrushao @masahi

This PR adds Tensor Core intrinsics with `shared.dyn` scope and change
the default rules to use `shared.dyn`.
@tvm-bot
Copy link
Collaborator

tvm-bot commented Feb 1, 2023

Thanks for contributing to TVM! Please refer to the contributing guidelines https://tvm.apache.org/docs/contribute/ for useful information and tips. Please request code reviews from Reviewers by @-ing them in a comment.

Generated by tvm-bot

@Hzfengsy Hzfengsy changed the title [MetaSchedule] Use shared.dyn for Tensor Core schedule rule [MetaSchedule] Use shared.dyn for Tensor Core Schedule Rules Feb 1, 2023
@junrushao junrushao merged commit 9bbc2c0 into apache:main Feb 1, 2023
@Hzfengsy Hzfengsy deleted the tensorcore_shared_dyn branch March 8, 2023 07:27
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants