Resolve broadcasts resulting from a PadOp by jacobhinkle · Pull Request #610 · NVIDIA/Fuser

jacobhinkle · 2023-07-18T17:47:26Z

This changes broadcast analysis to drop the assumption that all broadcasts are created through a BroadcastOp. This change allows a broadcast to be introduced between root and rfactor of any tensor, which happens if we pad an Iteration domain with negative pad widths, resulting in a Broadcast domain.

This adds a map_symbolic option to PairwiseRootDomainMap, which defaults to false and has the following effect (condition 5):

When map_symbolic == false, symbolic IterDomains are only mapped with one another if their extent expressions match (sameAs). This is the default mode and safely handles cases where one symbolic value will concretize to broadcast that is resolved by the other.
When map_symbolic == true, symbolic IterDomains are mapped regardless of their extent expressions. This mode is useful for producer to consumer mappings when we know we are not resolving a broadcast, but a placeholder extent expression might be used in the consumer. It is only used for propagating concretizations from producer to consumer, in which case we need to exact map aligned symbolic IDs.

Fixes #596. Fixes #357

jacobhinkle · 2023-07-18T17:48:51Z

Dynamic test is currently failing in ExpressionEvaluator::propagateBoundValuesThroughExactMaps due to #357 which we should probably finally address.

This prevents us overwriting a concretization to Broadcast with a concretization to Iteration.

test/test_resize.cpp

…asts

csrc/dynamic_transform.cpp

jacobhinkle · 2023-07-24T13:46:59Z

csrc/root_domain_map.cpp

+    // Condition 5
+    // At least one ID is symbolic.
+    // Map producer to consumer if and only if their extents are identical
+    if ((producer_id->isSymbolic() || consumer_id->isSymbolic()) &&
+        (!producer_id->extent()->sameAs(consumer_id->extent()))) {
+      itc++;
+      itp++;
+      continue;
+    }


This replaces earlier iterations on handling exact mapping of Symbolic domains, which I think were more cumbersome. The meaning of a Symbolic IterType is that it could potentially be a Broadcast domain. This is essentially reflecting that the extent expression hasn't been evaluated yet. If we have identical extents for two aligned Symbolic IterDomains, this should mean they will have the same IterType after concretization. In that case, they should be exact mapped. This lets us propagate bound extents in ExpressionEvaluator much more easily, while still avoiding exact mapping between transformed symbolic domains.

…asts

jacobhinkle · 2023-07-25T12:25:51Z

csrc/dynamic_transform.cpp

+  std::vector<std::unordered_map<IterDomain*, IterDomain*>> c2p_maps;
+  for (auto producer : ir_utils::filterByType<TensorView>(def->inputs())) {
+    PairwiseRootDomainMap root_map(producer, consumer);
+    c2p_maps.push_back(
+        root_map.mapConsumerToProducer(consumer->domain(), producer->domain()));
+  }


Precomputing c2p maps so they don't need to be recomputed for each consumer ID/producer pair.

This reverts commit 012e878.

Use it in ExpressionEvaluator differently than in concretization propagateP2C.

csrc/device_lower/analysis/trivial_broadcast.cpp

csrc/root_domain_map.h

csrc/dynamic_transform.cpp

test/test_dynamic_transform.cpp

…asts

jacobhinkle · 2023-09-26T13:34:34Z

!build

naoyam

LGTM. Thanks for the fix!

) Stacked on #610; see #876 (comment). This PR: - Changes the `analyzeResizes` pass in concretization to inspect expanded extents - Changes root->rfactor propagation in concretization to return early when IterType is already concretized. This avoids propagating `Broadcast` in this test when we have already marked the resized ID as `Iteration`. - Changes `PrecomputedValues::bindInputs` to bind not only metadata but also the actual `TensorView` arguments. I noticed that the `ExpressionEvaluator` used during compilation contained more bound scalars than the one used at execution where we fail to evaluate the extent. We had `i0` and `i2` bound at execution, but we did not have `T0` bound, so we could not compute `getMetaData(T0)`. At compilation, `T0` was bound so there was no problem until execution. Note that at compilation, we use `auto expr_eval = executor_utils::bindInputs(args, kernel);` whereas at compilation we use `evaluatorPrecomputedValues()->bindInputs(args);`. The difference is that `PrecomputedValues::bindInputs` will call `bindTensorMetaData` instead of binding the actual tensor. This PR also binds the actual tensor in addition to its metadata in that method. Fixes #870 --------- Co-authored-by: Gao, Xiang <qasdfgtyuiop@gmail.com>

jacobhinkle added 4 commits July 18, 2023 13:30

Add tests for padding to broadcast in various ways

e325d99

Set bcast consumer IDs with no bcast producers as origins

8d30c20

Only map root->rfactor IDs with same IterType in ca root dom map builder

e62d692

Switch to more challenging static pad to broadcast test

18cc3d1

jacobhinkle and others added 5 commits July 18, 2023 13:48

Merge branch 'main' into resolve_resize_broadcasts

af340f4

Remove commented code in static test

bed1358

Always skip mapping symbolic IterDomains that have different extents

ea05f07

Use maybeMutated id for isSymbolic check.

3fcafcb

This prevents us overwriting a concretization to Broadcast with a concretization to Iteration.

Merge branch 'main' into resolve_resize_broadcasts

78d8ffc

jacobhinkle commented Jul 19, 2023

View reviewed changes

test/test_resize.cpp Show resolved Hide resolved

jacobhinkle added 2 commits July 21, 2023 08:12

Merge remote-tracking branch 'origin/main' into resolve_resize_broadc…

5d0b6c5

…asts

Merge remote-tracking branch 'origin/main' into resolve_resize_broadc…

77910f0

…asts

jacobhinkle mentioned this pull request Jul 24, 2023

[WIP] Pad to broadcast #595

Closed

Update comment for isSymbolic check in mutate(TV)

e84ffe6

jacobhinkle commented Jul 24, 2023

View reviewed changes

csrc/dynamic_transform.cpp Show resolved Hide resolved

jacobhinkle commented Jul 24, 2023

View reviewed changes

jacobhinkle added 7 commits July 24, 2023 09:48

Add comment to #596 test

c6410b4

Expand comment about condition 5

b877331

Add comment to itertype check in ComputeAtRootDomainMapBuilder

dc3a156

Map Symbolic with non-Broadcast in propagateFromP2C

6020cd6

Rename test

9b48a6f

Merge remote-tracking branch 'origin/main' into resolve_resize_broadc…

6e2945a

…asts

Remove mapSymbolicNonBroadcast option. On always now

012e878

jacobhinkle commented Jul 25, 2023

View reviewed changes

jacobhinkle added 5 commits July 25, 2023 08:29

Revert "Remove mapSymbolicNonBroadcast option. On always now"

1aca3af

This reverts commit 012e878.

Update mapSymbolic option. Expose it to ExactRootDomainMap

eeff747

Use it in ExpressionEvaluator differently than in concretization propagateP2C.

Fix DynamicTransform4_CUDA, add long comment

3e974ba

Fix DynamicTransform3_CUDA

dbb7157

Fix DynamicTransform1_CUDA

a5f2543

jacobhinkle marked this pull request as ready for review September 13, 2023 16:17

jacobhinkle added 2 commits September 13, 2023 12:21

Register concretization from unmutated ID in root->rfactor

7ff24e6

Clean up comment in condition 5

1fe841a

jacobhinkle requested a review from naoyam September 13, 2023 16:29