[microTVM] Use autotuning to choose num_outputs value

Once #13242 lands, the convolution schedules for microTVM will have a tunable parameter `num_outputs`:

https://github.com/apache/tvm/blob/f11243a0eec0bae9a8f9b4a6d4ad6152de4e3fc9/python/tvm/topi/arm_cpu/qnn.py#L235-L239

As the comments states, picking this value is important for performance. It would be awesome to be able to autotune this - the correct value is very dependent on the exact parameters of the convolution, picking correctly will have a >10% impact on performance, and predicting it without autotuning would be challenging (though theoretically possible).

Note that this value is used in the _compute function_, not in the _scheduling function_, which makes autotuning harder.

cc @alanmacd @gromero @leandron @mehrdadh

	# Decide how many sums our function should have running at the same time. Doing
	# this lets us do "more work" for each memory load, but doing too many of them causes us to run
	# out of registers. Currently this is set to either 1 or 2, but autotuning this value would
	# improve performance a lot.
	num_sums = 2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[microTVM] Use autotuning to choose num_outputs value #13528

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[microTVM] Use autotuning to choose num_outputs value #13528

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions