Enable sliding window in registers by dsharletg · Pull Request #5815 · halide/Halide

dsharletg · 2021-03-16T18:45:32Z

This PR enables an alternative strategy for sliding window (used when the producer of a sliding window is store_in(MemoryType::Register)). The alternative strategy works by shifting the producer such that the previously computed values are copied from the previous iteration, and the newly computed values are always computed in the same place. This avoids non-constant addresses, which are a prerequisite for storing things in registers (on most targets).

This mostly fixes #1820, though it currently requires bending over backwards in the schedule a bit for sliding vectorized dimensions.

@abadams contributed heavily to the ideas implemented here.

This PR has some other related changes:

Treat if_then_else intrinsics like select when partitioning loops.
After we're done with using boxes_touched, rebase some loops to have a min of 0, to enable stronger simplifications.
More simplifier rules.

…e-registers

…alization bounds.

…/Halide into dsharletg/slide-registers

dsharletg · 2021-03-22T15:28:05Z

Ping on a code review, please take a look at this when you get a chance @abadams. I've confirmed that this does address the use case we had in mind well.

abadams · 2021-03-23T19:08:03Z

src/RebaseLoopsToZero.cpp

+    switch (type) {
+    case ForType::Extern:
+    case ForType::GPUBlock:
+    case ForType::GPUThread:


gpu loops absolutely need to be rebased to zero, and there's a pass inside FuseGPUThreadLoops.cpp that does it. Is that mutator now redundant?

That mutator works a little differently. It substitutes loop_var + min, rather than makes a new let, and that seems like maybe something that might matter for another pass/logic (there's a lot of stuff happening in FuseGPUThreadLoops). I would also be careful about changing when loops get rebased to 0.

Maybe this PR should make it so rebase_loops_to_zero accepts a set of ForType that get rebased, and use that in FuseGPUThreadLoops? That would at least keep the rebasing happening at the same time, and the only change would be substitute vs. let.

I think the new mutator is just more correct than the old one. I guess the old one happens earlier though, so we can't just rely on the new one to do the mutation. Your proposal sounds good, but I have no strong feelings one way or the other.

I think the duplication here is minimal and there are non-minimal risks in messing with this, so I'll save it for a separate PR (will file an issue).

abadams · 2021-03-23T19:14:01Z

src/Simplify_LT.cpp

              rewrite(select(y, z, x + c0) < x + c1, y && (z < x + c1), c0 >= c1) ||
              rewrite(select(y, z, x + c0) < x + c1, !y || (z < x + c1), c0 < c1) ||

+              rewrite(c0 < select(x, c1, c2), select(x, fold(c0 < c1), fold(c0 < c2))) ||


These rules all formally verify ok

src/SlidingWindow.cpp

abadams · 2021-03-23T19:17:53Z

Generally lgtm. I think the substitute call is fishy though.

…e-registers

steven-johnson · 2021-03-24T16:28:57Z

The OSX failure is unrelated (will be fixed by #5841), should be good to land

dsharletg and others added 24 commits March 13, 2021 00:36

Sliding in registers

e81714e

Fix some failure cases.

89ef82a

Handle if_then_else in loop partitioning.

c8a3fb1

Add rebase_loops_to_zero pass.

d05c72b

Use select instead of if_then_else.

975d700

Merge branch 'master' of github.com:halide/Halide into dsharletg/slid…

85c0ab5

…e-registers

Add select comparison simplifications.

085ba48

Don't rewrite lets

411e0cb

Rebase producer loops of register slides to 0, and don't overwrite re…

ce56515

…alization bounds.

Add rules for ramp < broadcast

4c0a6c5

Put the likely on the old value instead of the new value.

f422129

New rules for comparing ramps and broadcasts

8466e4d

Merge branch 'dsharletg/slide-registers' of https://github.com/halide…

0f2c9e9

…/Halide into dsharletg/slide-registers

Switch back to if_then_else

f7111ca

Update comments.

16ad4e0

Don't try to fold dimensions with a constant min or max.

bc6a7c6

More comments.

944ab79

Make the vectorized register sliding window test tighter.

b94a59b

Remove debug helper.

70f9d7a

Fix tests broken by loop rebasing.

e54dd90

Move rebasing after loop partitioning

86b4fd6

clang-format

afe379d

clang-tidy

b68bdcf

Also put MemoryType::Register on the stack.

c142b77

steven-johnson requested a review from abadams March 18, 2021 22:44

Merge branch 'master' into dsharletg/slide-registers

2e1f91e

abadams reviewed Mar 23, 2021

View reviewed changes

src/SlidingWindow.cpp Outdated Show resolved Hide resolved

dsharletg added 2 commits March 23, 2021 13:52

Merge branch 'master' of github.com:halide/Halide into dsharletg/slid…

25b3997

…e-registers

Expand arg before substitute.

084aba3

abadams approved these changes Mar 23, 2021

View reviewed changes

dsharletg merged commit 92dfc82 into master Mar 24, 2021

dsharletg deleted the dsharletg/slide-registers branch March 24, 2021 17:57

alexreinking added this to the v12.0.0 milestone May 19, 2021

Bastacyclop mentioned this pull request Dec 20, 2021

Blur implementation #2905

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable sliding window in registers#5815

Enable sliding window in registers#5815
dsharletg merged 27 commits intomasterfrom
dsharletg/slide-registers

dsharletg commented Mar 16, 2021

Uh oh!

dsharletg commented Mar 22, 2021

Uh oh!

abadams Mar 23, 2021

Uh oh!

dsharletg Mar 23, 2021

Uh oh!

abadams Mar 23, 2021

Uh oh!

dsharletg Mar 24, 2021 •

edited

Loading

Uh oh!

abadams Mar 23, 2021

Uh oh!

Uh oh!

abadams commented Mar 23, 2021

Uh oh!

steven-johnson commented Mar 24, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

dsharletg commented Mar 16, 2021

Uh oh!

dsharletg commented Mar 22, 2021

Uh oh!

abadams Mar 23, 2021

Choose a reason for hiding this comment

Uh oh!

dsharletg Mar 23, 2021

Choose a reason for hiding this comment

Uh oh!

abadams Mar 23, 2021

Choose a reason for hiding this comment

Uh oh!

dsharletg Mar 24, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

abadams Mar 23, 2021

Choose a reason for hiding this comment

Uh oh!

Uh oh!

abadams commented Mar 23, 2021

Uh oh!

steven-johnson commented Mar 24, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

dsharletg Mar 24, 2021 •

edited

Loading