Layout propagation by jjsjann123 · Pull Request #1788 · NVIDIA/Fuser

jjsjann123 · 2024-02-19T06:19:56Z

Stacked PRs:
#1755 enabling layout propagation through runtime
#1792 propagation rule for broadcast
#1790 propagation rule for binary op
==== #1788 adding layout inference pass <- this one

What's in this PR:
inferenceAllocationOrder pass that works on an entire Fusion:
It computes AllocationOrder on inputs by looking at each TensorView's allocation_domain and rfactor_domain;
It uses a predefined rule (in AllocationOrderInferencer) to traverse and propagate AllocationOrder from inputs to the entire fusion;

Note that the pass itself doesn't mutate the fusion IR. It's just a utility function that suggests ways to specify allocation domain to be used by other optimization passes.

adding inferenceAllocationOrder pass function;
adding propagate rule for unary op;
adding cpp test to verify propagation rule;

Quick design doc: #1756

Future Work:

expanding propagation rule to cover more operation;

jjsjann123 · 2024-02-19T08:09:54Z

!build

csrc/preseg_passes/allocation_order_inference.cpp

test/test_allocation_order_inference.cpp

Co-authored-by: Jingyue Wu <wujingyue@gmail.com>

…yout_propagation_pr_0

jjsjann123 · 2024-02-20T04:23:32Z

!build

jjsjann123 · 2024-02-20T23:42:21Z

Failing test isn't related. merge as-is.

Stacked PRs: ==== #1755 enabling layout propagation through runtime <- **_this one_** #1792 propagation rule for broadcast #1790 propagation rule for binary op #1788 adding layout inference pass What's in this PR: Enabling the MemoryFormat optimization pass in runtime. The pass is run as part of pre_segment optimization pass. Adding cpp test to verify optimization behavior Quick design doc: #1756 --------- Co-authored-by: Jacob Hinkle <1454944+jacobhinkle@users.noreply.github.com> Co-authored-by: Jingyue Wu <wujingyue@gmail.com>

Stacked PRs: #1755 enabling layout propagation through runtime #1792 propagation rule for broadcast ==== #1790 propagation rule for binary op **_<- this one_** #1788 adding layout inference pass What's in this PR: BinaryOp propagation tries to merge the allocation order of both inputs: * when there's only one operand is a tensor, we just forward the recorded allocation order * when both operands are tensors, we resolve it by: i. prioritize the tensor with less broadcast iterdomain; ii. otherwise, we just propagate the allocation order of lhs. Propagation rule for binary operation, - [x] adding propagate rule for binary op; - [x] handling two scalar; - [x] handling intermediate tensors (factory tensor); - [x] adding cpp test to verify propagation rule; --------- Co-authored-by: Jacob Hinkle <1454944+jacobhinkle@users.noreply.github.com> Co-authored-by: Jingyue Wu <wujingyue@gmail.com>

Stacked PRs: #1755 enabling layout propagation through runtime ==== #1792 propagation rule for broadcast **_<- this one_** #1790 propagation rule for binary op #1788 adding layout inference pass What's in this PR: BroadcastOp propagation tries to push all new broadcast iterdomain as outer dimensions for the output tensor. - [x] adding propagate rule for broadcast op; - [x] adding cpp test to verify propagation rule; --------- Co-authored-by: Jacob Hinkle <1454944+jacobhinkle@users.noreply.github.com> Co-authored-by: Jingyue Wu <wujingyue@gmail.com>

jjsjann123 and others added 30 commits February 6, 2024 17:48

WIP

92b3f76

adding passes

25fece2

adding test; enabling build

ea034ea

fixing build

a2ee7fa

trying to fix build

c1b99cb

fixing build

5cd38fe

fixing test build

8b4bbad

Merge remote-tracking branch 'origin/main' into layout_propagation

6d2b854

updating test, updating AliasType->AllocationType

9b263ce

quick fixing reference

c18d565

Merge remote-tracking branch 'origin/main' into layout_propagation

9b1e69d

start working on cpp tests

f98a63a

adding test

5503d64

updating test with assert

668b3c8

updating tests; fixing logic

723bf8e

Merge remote-tracking branch 'origin/main' into layout_propagation

5ea1c5b

Merge remote-tracking branch 'origin/main' into layout_propagation

37a34c4

clangformat

3bcde5a

clangtidy

8769d06

adding comment

d603c45

clangformat

956ac42

Merge branch 'main' into layout_propagation

f41aef2

clangtidy

f22bf27

WIP

d5dee43

fixing broadcast rule

833dcfc

Merge remote-tracking branch 'origin/main' into HEAD

c0f8b01

more comment

a2d9297

clangformat

5fa6148

review comment

178174d

review comments

f5f580d

jjsjann123 added 4 commits February 18, 2024 22:38

renaming files

ec4a568

in case we have reduction on inputs

ef6fdc9

more review comments

7d62bb2

renaming variable

4e251ab

This was referenced Feb 19, 2024

Layout propagation #1744

Closed

adding propagation rule and test for binary ops #1790

Merged

propagation rule and test added for broadcast op #1792

Merged

Layout propagation (Part 2) - Enable #1755

Merged

jjsjann123 mentioned this pull request Feb 19, 2024

[Feature Request] better memory format decision for outputs #1756

Closed

6 tasks

jjsjann123 marked this pull request as ready for review February 19, 2024 17:51

jjsjann123 requested review from jacobhinkle, kevinstephano, naoyam, wujingyue and zasdfgbnm February 19, 2024 17:52

wujingyue approved these changes Feb 19, 2024

View reviewed changes

csrc/preseg_passes/allocation_order_inference.cpp Outdated Show resolved Hide resolved

test/test_allocation_order_inference.cpp Outdated Show resolved Hide resolved

test/test_allocation_order_inference.cpp Outdated Show resolved Hide resolved

jjsjann123 and others added 6 commits February 19, 2024 19:09

namespace update

5132db0

moving countNonBroadcast

d600a06

Update test/test_allocation_order_inference.cpp

ddfe5f3

Co-authored-by: Jingyue Wu <wujingyue@gmail.com>

updating tests per comment

b1ad03e

Merge remote-tracking branch 'origin/layout_propagation_pr_0' into la…

e3e31f6

…yout_propagation_pr_0

Merge branch 'main' into layout_propagation_pr_0

9d455b8

jjsjann123 merged commit 7d2740c into main Feb 20, 2024

jjsjann123 deleted the layout_propagation_pr_0 branch February 20, 2024 23:42

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Layout propagation#1788

Layout propagation#1788
jjsjann123 merged 55 commits intomainfrom
layout_propagation_pr_0

jjsjann123 commented Feb 19, 2024 •

edited

Loading

Uh oh!

jjsjann123 commented Feb 19, 2024

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jjsjann123 commented Feb 20, 2024

Uh oh!

jjsjann123 commented Feb 20, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

jjsjann123 commented Feb 19, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jjsjann123 commented Feb 19, 2024

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jjsjann123 commented Feb 20, 2024

Uh oh!

jjsjann123 commented Feb 20, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

jjsjann123 commented Feb 19, 2024 •

edited

Loading