[ETHOSN] Inline non-compute-intensive partitions #13092

lhutton1 · 2022-10-17T08:40:52Z

Adds a pass that analyzes functions partitioned for the NPU and inlines those that are deemed "non-compute-intensive" back to the main function so that they can be considered for other backends. The current heuristic for deciding a non-compute-intensive function is to collectively check all of the operations in the function have no multiply accumulate operations. This heuristic is not optimal; optimization is left for future exploration.

This pass is inspired by the "IsComputeIntensiveGraph" pass in the TensorRT integration.

cc @ashutosh-arm @leandron

tvm-bot · 2022-10-17T08:40:55Z

Thanks for contributing to TVM! Please refer to the contributing guidelines https://tvm.apache.org/docs/contribute/ for useful information and tips. Please request code reviews from Reviewers by @-ing them in a comment.

cc @ashutosh-arm _{See #10317 for details}
Built docs for commit bd0bf6c can be found here.

_{Generated by tvm-bot}

python/tvm/relay/op/contrib/ethosn.py

asparkhi · 2022-10-19T11:23:53Z

src/relay/backend/contrib/ethosn/inline_partitions.cc

+  /*! \brief Whether or not the partitioned function is consdiered compute intensive. */
+  bool is_compute_intensive;
+  /*! \brief A set of operators considered compute intensive. */
+  const std::unordered_set<std::string> compute_intensive_operators{


The pass looks extremely useful. In order to make this pass more generic, would it make sense to accept this list as an input to the pass? In case of npu, this can be passed from the python partitioner.

me overthinking 🤔 : maybe in future, this list can be accepted as a tvmc command line argument to make the pass even more generic?

I was a bit hesitant to make this pass more generic as different hardware might require a different heuristic all together (rather than just a different list of operators). That said, it does seem useful for the user to customise/tune the pass from TVMC if necessary, especially since the heuristic is not optimal. But yep, think this would be good as a followup

src/relay/backend/contrib/ethosn/inline_partitions.cc

asparkhi · 2022-10-19T11:36:10Z

src/relay/backend/contrib/ethosn/inline_partitions.cc

+    }
+
+    if (op_name != "") {
+      if (compute_intensive_operators.find(op_name) != compute_intensive_operators.end()) {


question: in this mechanism, a partitioned function containing lot of non compute intensive ops could be inlined too. is that intentional?

Yes currently that's possible. It's difficult to come up with a sensible limit here without being able to estimate the performance, perhaps if needed we can expose this option to the user in the future?

Adds a pass that analyzes functions partitioned for the NPU and inlines those that are deemed "non-compute-intensive" back to the main function so that they can be considered for other backends. The current heurisic for deciding a non-compute-intensive function is to collectively check all of the operations in the function have no multiply accumulate operations. This heuristic is not optimial; optimization is left for future exploration. This pass is inspired by the "IsComputeIntensiveGraph" pass in the TensorRT integration. Change-Id: I20c197702f5252f102cfc1e4b4635ab836aa7835

* 'inline_non_compute_intensive_partitions' -> 'is_inline_non_compute _intensive_partitions_enabled'. * remove no MAC operations. * fix network test. Change-Id: Ie1015b27f37e47544bed6f0aff819ee4649de579

Change-Id: I0ee0af071dc77c91e0ef0f6753506cb40d1d1859

Change-Id: Ie918d7f1059f032282f1f5eeffda38f4febcd59c

asparkhi

LGTM! Thanks @lhutton1 for making it as generic as possible. It can be used by many other targets with little modifications.

leandron

LGTM, thanks @lhutton1 @ashutosh-arm

* [ETHOSN] Inline non-compute-intensive partitions Adds a pass that analyzes functions partitioned for the NPU and inlines those that are deemed "non-compute-intensive" back to the main function so that they can be considered for other backends. The current heurisic for deciding a non-compute-intensive function is to collectively check all of the operations in the function have no multiply accumulate operations. This heuristic is not optimial; optimization is left for future exploration. This pass is inspired by the "IsComputeIntensiveGraph" pass in the TensorRT integration. Change-Id: I20c197702f5252f102cfc1e4b4635ab836aa7835 * Address comments * 'inline_non_compute_intensive_partitions' -> 'is_inline_non_compute _intensive_partitions_enabled'. * remove no MAC operations. * fix network test. Change-Id: Ie1015b27f37e47544bed6f0aff819ee4649de579 * Fix failing unit tests due to optimization Change-Id: I0ee0af071dc77c91e0ef0f6753506cb40d1d1859 * Add future exploration suggestions Change-Id: Ie918d7f1059f032282f1f5eeffda38f4febcd59c

github-actions bot requested a review from leandron October 17, 2022 08:41

areusch added needs-triage PRs or issues that need to be investigated by maintainers to find the right assignees to address it and removed needs-triage PRs or issues that need to be investigated by maintainers to find the right assignees to address it labels Oct 19, 2022

asparkhi reviewed Oct 19, 2022

View reviewed changes

lhutton1 force-pushed the unpartition-subgraphs branch from 4fa075e to 50eed63 Compare October 20, 2022 11:15

Address comments

3defac8

* 'inline_non_compute_intensive_partitions' -> 'is_inline_non_compute _intensive_partitions_enabled'. * remove no MAC operations. * fix network test. Change-Id: Ie1015b27f37e47544bed6f0aff819ee4649de579

lhutton1 force-pushed the unpartition-subgraphs branch from 50eed63 to 3defac8 Compare October 20, 2022 11:58

lhutton1 added 2 commits October 20, 2022 13:31

Fix failing unit tests due to optimization

ea1c740

Change-Id: I0ee0af071dc77c91e0ef0f6753506cb40d1d1859

Add future exploration suggestions

bd0bf6c

Change-Id: Ie918d7f1059f032282f1f5eeffda38f4febcd59c

asparkhi approved these changes Nov 3, 2022

View reviewed changes

leandron approved these changes Nov 3, 2022

View reviewed changes

leandron merged commit 75921fb into apache:main Nov 3, 2022

lhutton1 deleted the unpartition-subgraphs branch November 15, 2022 15:40

leandron mentioned this pull request Feb 1, 2023

TVM v0.11.0 Release Candidate Notes #13899

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ETHOSN] Inline non-compute-intensive partitions #13092

[ETHOSN] Inline non-compute-intensive partitions #13092

Uh oh!

lhutton1 commented Oct 17, 2022

Uh oh!

tvm-bot commented Oct 17, 2022 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

asparkhi Oct 19, 2022 •

edited

Loading

Uh oh!

asparkhi Oct 19, 2022

Uh oh!

lhutton1 Oct 20, 2022

Uh oh!

Uh oh!

asparkhi Oct 19, 2022

Uh oh!

lhutton1 Oct 20, 2022

Uh oh!

asparkhi left a comment •

edited

Loading

Uh oh!

leandron left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

[ETHOSN] Inline non-compute-intensive partitions #13092

[ETHOSN] Inline non-compute-intensive partitions #13092

Uh oh!

Conversation

lhutton1 commented Oct 17, 2022

Uh oh!

tvm-bot commented Oct 17, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

asparkhi Oct 19, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

asparkhi Oct 19, 2022

Choose a reason for hiding this comment

Uh oh!

lhutton1 Oct 20, 2022

Choose a reason for hiding this comment

Uh oh!

Uh oh!

asparkhi Oct 19, 2022

Choose a reason for hiding this comment

Uh oh!

lhutton1 Oct 20, 2022

Choose a reason for hiding this comment

Uh oh!

asparkhi left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

leandron left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

tvm-bot commented Oct 17, 2022 •

edited

Loading

asparkhi Oct 19, 2022 •

edited

Loading

asparkhi left a comment •

edited

Loading