[AutoScheduler] Relay integration : Task extraction #6710

merrymercy · 2020-10-19T09:29:05Z

This pr implements the basic relay integration for auto-scheduler.

Approach

register auto-scheduler as an implementation in the OpStrategy. We have a universal schedule function for all topi compute functions.
Use tracing to extract all tasks similar to autotvm.

If auto-scheduler is enabled by auto_scheduler.enable_relay_integration, then auto-scheduler is always preferred to any other registered implementations.

Limitation

Ideally, we should be able to mix autotvm, auto-scheduler, and static libraries, then we can select the best implementation according to profiling records.
However, this is not trivial to implement, because auto-scheduler works on a subgraph level while the current OpStrategy is designed for single operator level.
The limitation of OpStrategy also makes auto-scheduler unable to support multiple compute declarations. For example, auto-scheduler cannot select direct vs. winograd based on profiling records, while autotvm can easily do this.

Minor todo

Add an API to disable the cache in relay compilation, so we can extract the weights (i.e., the number of appearances) of each task for the task-scheduler.
Clean up dispather.py

merrymercy · 2020-10-19T09:29:43Z

cc @icemelon9 @tqchen @minminsun @jcf94 @comaniac @FrozenGene

comaniac

Theoretically we need to add add_auto_schedular to all ops but we only have 2 right now. Is there any concern or plan?

python/tvm/auto_scheduler/relay_integration.py

python/tvm/auto_scheduler/dispatcher.py

tests/python/relay/test_auto_scheduler_task_extraction.py

masahi · 2020-10-24T03:53:31Z

python/tvm/auto_scheduler/relay_integration.py

+auto_schedule_impl_suffix = ".auto_scheduler"
+
+
+def auto_schedule_topi(outs):


Is it better to call it auto_schedule_te (and replace all topi in this file with te)? I imagine there are cases where inputs to this function don't come from topi, such as TE from gradient ops generated by Relay.

We cannot call it auto_schedule_te. We have another function auto_schedule (https://github.com/apache/incubator-tvm/blob/e59c603515befb02035e237794aa0645dbfbaf09/python/tvm/auto_scheduler/auto_schedule.py#L161) for your use case.
But this function is designed to be used as a TOPI schedule function for Relay, because it does a lot of other things.

python/tvm/auto_scheduler/dispatcher.py

tests/python/relay/test_auto_scheduler_task_extraction.py

comaniac

LGTM. Please fix the CI.

merrymercy · 2020-10-27T00:34:23Z

@comaniac I only added conv2d_nhwc and dense in this PR to test the functionality.
I will send another PR with more ops and a tutorial on tuning networks for CUDA.

FrozenGene · 2020-10-29T06:04:19Z

python/tvm/auto_scheduler/utils.py

 try:
    import psutil
 except ImportError:
-    raise ImportError("psutil not found, try `pip install psutil` to fix this")


I prefer keeping this ImportError. During internal development, we have met several times our environment doesn't import this package, we could tune but meet process error and find this root cause is hard.

FrozenGene · 2020-10-29T06:07:38Z

python/tvm/auto_scheduler/utils.py

 def kill_child_processes(parent_pid, sig=signal.SIGTERM):
    """kill all child processes recursively"""
+    if not psutil:
+        raise ImportError("psutil not found, try `pip install psutil` to fix this")


Ah...you move it here. But i am curious why we should need it here. For testing?

Yes, it is for testing. The goal is to minimize the dependency to run tvm.
If users do not use auto-scheduler, then they do not need to install this package.

tqchen · 2020-10-31T01:06:10Z

@merrymercy the code seems to cause test problem on MacOS, please keep tornado (rpc tracker) as an optional dep https://github.com/apache/incubator-tvm/actions/runs/338532340

merrymercy · 2020-10-31T01:47:11Z

Fixed by #6807

* add task extraction * fix evo search * fix tests * fix test * fix docstring * fix docstring * update workload registry * fix warning * fix test * fix fallback * fix lint * fix tests

merrymercy force-pushed the master branch 3 times, most recently from 7758f24 to 5d93c61 Compare October 19, 2020 09:44

comaniac requested changes Oct 19, 2020

View reviewed changes

ZihengJiang added the status: need review label Oct 21, 2020

masahi reviewed Oct 24, 2020

View reviewed changes

FrozenGene reviewed Oct 26, 2020

View reviewed changes

python/tvm/auto_scheduler/dispatcher.py Outdated Show resolved Hide resolved

tests/python/relay/test_auto_scheduler_task_extraction.py Outdated Show resolved Hide resolved

merrymercy mentioned this pull request Oct 26, 2020

[FIX,AUTOSCHEDULER] Fix auto_scheduler to run with multiprocessing's spawn start method #6671

Merged

merrymercy force-pushed the master branch from 5d93c61 to 8c60780 Compare October 26, 2020 13:19

comaniac approved these changes Oct 26, 2020

View reviewed changes

merrymercy force-pushed the master branch 2 times, most recently from df161b9 to ba6d2aa Compare October 27, 2020 02:44

FrozenGene reviewed Oct 29, 2020

View reviewed changes

merrymercy added 5 commits October 30, 2020 04:34

add task extraction

ce5d670

fix evo search

0164d51

fix tests

368519c

fix test

2e5b6ad

fix docstring

e01ab2f

merrymercy force-pushed the master branch from 7693587 to e01ab2f Compare October 30, 2020 04:35

merrymercy added 4 commits October 30, 2020 04:41

fix docstring

10e025b

update workload registry

e1a4abc

fix warning

c11a39d

fix test

7c0e18d

merrymercy force-pushed the master branch from d0708b1 to 7c0e18d Compare October 30, 2020 17:06

merrymercy added 3 commits October 30, 2020 17:17

fix fallback

8104ca8

fix lint

4bcdd6d

fix tests

d76bb48

merrymercy merged commit a261454 into apache:main Oct 30, 2020

comaniac added status: accepted and removed status: need review labels Nov 2, 2020

		auto_schedule_impl_suffix = ".auto_scheduler"


		def auto_schedule_topi(outs):

[AutoScheduler] Relay integration : Task extraction #6710

[AutoScheduler] Relay integration : Task extraction #6710

Uh oh!

Conversation

merrymercy commented Oct 19, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Approach

Limitation

Minor todo

Uh oh!

merrymercy commented Oct 19, 2020

Uh oh!

comaniac left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

masahi Oct 24, 2020

Choose a reason for hiding this comment

Uh oh!

merrymercy Oct 26, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

comaniac left a comment

Choose a reason for hiding this comment

Uh oh!

merrymercy commented Oct 27, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

FrozenGene Oct 29, 2020

Choose a reason for hiding this comment

Uh oh!

FrozenGene Oct 29, 2020

Choose a reason for hiding this comment

Uh oh!

merrymercy Oct 30, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

tqchen commented Oct 31, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

merrymercy commented Oct 31, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

merrymercy commented Oct 19, 2020 •

edited

Loading

merrymercy Oct 26, 2020 •

edited

Loading

merrymercy commented Oct 27, 2020 •

edited

Loading

merrymercy Oct 30, 2020 •

edited

Loading

tqchen commented Oct 31, 2020 •

edited

Loading