Vectorization cleanup by zasdfgbnm · Pull Request #393 · NVIDIA/Fuser

zasdfgbnm · 2023-05-23T17:19:54Z

Currently, vectorization analysis is done by first calling getInnerDimVectorizableWidth to compute teh vectorization factor of the innermost dimension, then expand that to contiguous merge of getMaxVectorizableWidth. However, these two functions are very similar, almost a copy-paste of each other. Therefore, it makes no sense to use getInnerDimVectorizableWidth then getMaxVectorizableWidth, we should remove getInnerDimVectorizableWidth and go directly to getMaxVectorizableWidth.

This PR removes getInnerDimVectorizableWidth, anded a boolean option contig_merge to getMaxVectorizableWidth to reproduce the behavior of getInnerDimVectorizableWidth. The contig_merge=false is only used in the transpose scheduler, where the which dimensions are contiguously merge to which ID is more complicated and is not considered in heuristics.

Unfortunately, because the variable name change, I can not diff the kernel to check if they match exactly or not, but I do briefly skimmed through the diff by eye, and I don't find any change in the vectorization factor.

The V100 failure is OOM.

This reverts commit 5be2421.

This reverts commit f498f3f.

This reverts commit 2010de6.

zasdfgbnm · 2023-05-23T17:24:37Z

csrc/scheduler/vectorize_helper.cpp

  return vectorize_size;
 }

-size_t getExpandedVectorization(


A big portion of getVectorizationFactor has been removed, so I just cut-paste the function body of this function into getVectorizationFactor and remove this function.

…anup

zasdfgbnm · 2023-05-24T18:28:49Z

!build

zasdfgbnm · 2023-05-24T19:59:53Z

!build

csrc/scheduler/vectorize_helper.cpp

zasdfgbnm · 2023-05-25T14:58:08Z

!build

naoyam

The changes make sense to me. Have you found any change in actual vectorization?

csrc/scheduler/vectorize_helper.cpp

zasdfgbnm · 2023-05-26T03:21:37Z

The changes make sense to me. Have you found any change in actual vectorization?

No, I didn't find any. But I didn't check programmatically. I just used my eye to inspect the output of

difft --language cpp ../Fuser3/old-kernels/ new-kernels/ | grep load

and didn't find any.

zasdfgbnm added 8 commits May 22, 2023 16:13

Allocation domain support in cacheFork

5be2421

TensorArgAbstract allocation size

2010de6

registry

f498f3f

Revert "Allocation domain support in cacheFork"

5dd9844

This reverts commit 5be2421.

cleanup vectorize

3340235

Revert "registry"

dd83215

This reverts commit f498f3f.

Revert "TensorArgAbstract allocation size"

91bc0c1

This reverts commit 2010de6.

format

6fcbbbc

zasdfgbnm commented May 23, 2023

View reviewed changes

zasdfgbnm marked this pull request as draft May 23, 2023 17:25

jacobhinkle mentioned this pull request May 23, 2023

Fix getPointwiseHeuristics with empty tensors #369

Closed

zasdfgbnm added 4 commits May 24, 2023 10:52

Merge branch 'main' of github.com:NVIDIA/Fuser into vectorization-cle…

499777b

…anup

transpose fix

3b92829

fix test

b740de2

doc

a4099e5

more vectorization

344ef49

zasdfgbnm marked this pull request as ready for review May 24, 2023 21:06

zasdfgbnm requested a review from naoyam May 24, 2023 21:08

Merge branch 'main' into vectorization-cleanup

35d31df

zasdfgbnm commented May 25, 2023

View reviewed changes

csrc/scheduler/vectorize_helper.cpp Show resolved Hide resolved

naoyam approved these changes May 26, 2023

View reviewed changes

csrc/scheduler/vectorize_helper.cpp Show resolved Hide resolved

Merge branch 'main' into vectorization-cleanup

f4e6ade

zasdfgbnm merged commit 88a782b into main May 26, 2023

zasdfgbnm deleted the vectorization-cleanup branch May 26, 2023 04:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Vectorization cleanup#393

Vectorization cleanup#393
zasdfgbnm merged 15 commits intomainfrom
vectorization-cleanup

zasdfgbnm commented May 23, 2023 •

edited

Loading

Uh oh!

zasdfgbnm May 23, 2023

Uh oh!

zasdfgbnm commented May 24, 2023

Uh oh!

zasdfgbnm commented May 24, 2023

Uh oh!

Uh oh!

zasdfgbnm commented May 25, 2023

Uh oh!

naoyam left a comment

Uh oh!

Uh oh!

zasdfgbnm commented May 26, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

zasdfgbnm commented May 23, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

zasdfgbnm May 23, 2023

Choose a reason for hiding this comment

Uh oh!

zasdfgbnm commented May 24, 2023

Uh oh!

zasdfgbnm commented May 24, 2023

Uh oh!

Uh oh!

zasdfgbnm commented May 25, 2023

Uh oh!

naoyam left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

zasdfgbnm commented May 26, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

zasdfgbnm commented May 23, 2023 •

edited

Loading