ATen scheduler for the new `Matmul/LinearOp` IR nodes by Priya2698 · Pull Request #2209 · NVIDIA/Fuser

Priya2698 · 2024-05-07T00:07:04Z

Adds a new scheduler -- ExprEvalScheduler that accepts the MatmulOp and LinearOp (next PR) for ATen evaluation.
Modify the matmul input generator to test for all cases supported by Thunder.
The eagerMatmul API is renamed and replaces the existing matmul API. fd.ops.matmul now creates a MatmulOp (except in a few special cases such as scalar dot product, for eg: [K] x [K].

Issue #2149, #2092.

jacobhinkle

Looking great. Sprinkle in a few tests once you merge #2175 and we'll be on our way.

csrc/scheduler/expr_eval_sched.cpp

csrc/scheduler/expr_eval_sched.h

Co-authored-by: Jacob Hinkle <1454944+jacobhinkle@users.noreply.github.com>

Priya2698 · 2024-05-13T23:58:13Z

!build

jacobhinkle

Looks good overall. I am just slightly concerned about removing stuff from PairwiseRootDomainMap::map.

csrc/ops/composite.h

csrc/root_domain_map.cpp

csrc/scheduler/expr_eval_sched.cpp

csrc/scheduler/heuristic.h

jacobhinkle · 2024-05-14T00:33:44Z

csrc/scheduler/heuristic_types.h

 //! Define a schedule table to loop over all the heuristics in priority order.
-constexpr std::array<ScheduleHeuristic, 8> all_heuristics_in_priority_order = {
+constexpr std::array<ScheduleHeuristic, 9> all_heuristics_in_priority_order = {
+    ScheduleHeuristic::ExprEval,


Should NoOp come before ExprEval?

Some cases get accepted by NoOp scheduler, which is why I prioritized ExprEval scheduler.

We may need to change the heuristics of NoOp if we want to switch the order.

Oh! Thanks for mentioning that. Does NoOp scheduler accept the cases where you have a single scalar output? Because it seems to me that it would do so based on this code:

Fuser/csrc/fusion.cpp

Lines 341 to 359 in 8c18701

bool Fusion::isNoOp() {

if (exprs().empty()) {

return true;

}

for (auto out_tv : ir_utils::filterByType<TensorView>(outputs())) {

const std::vector<IterDomain*>& root_dom =

TensorDomain::noReductions(out_tv->getMaybeRFactorDomain());

const bool size_zero =

std::any_of(root_dom.begin(), root_dom.end(), [](IterDomain* id) {

return id->extent()->isConstScalar() && id->extent()->evaluate() == 0;

});

if (!size_zero) {

return false;

}

}

return true;

}

~~We should add a special case for zero-dimensional outputs there~~ On second look, it seems like size_zero would be false in the case that root_dom.empty(). However, the code below might not properly handle zero-dimensional outputs:

Fuser/csrc/scheduler/no_op.cpp

Lines 71 to 80 in 8c18701

// Check that all outputs are either broadcast or ignored reduction.

for (auto out_tv : ir_utils::filterByType<TensorView>(fusion->outputs())) {

auto concrete_dimension = TensorDomain::noReductions(

TensorDomain::noBroadcasts(out_tv->getLeafDomain()));

if (!concrete_dimension.empty()) {

scheduler_debug_utils::canScheduleRejectReason(

heuristicType(), "output has a concrete dimension");

return false;

}

}

[ FAILED ] 6 tests, listed below: [ FAILED ] ATenNodesParametrizedTest.MatmulNodeConcrete/2, where GetParam() = ({ 32 }, { 32, 1 }) [ FAILED ] ATenNodesParametrizedTest.MatmulNodeConcrete/8, where GetParam() = ({ 1, 32 }, { 32 }) [ FAILED ] ATenNodesParametrizedTest.MatmulNodeConcrete/10, where GetParam() = ({ 1, 32 }, { 32, 1 }) [ FAILED ] ATenNodesParametrizedTest.MatmulNodeSymbolic/2, where GetParam() = ({ 32 }, { 32, 1 }) [ FAILED ] ATenNodesParametrizedTest.MatmulNodeSymbolic/8, where GetParam() = ({ 1, 32 }, { 32 }) [ FAILED ] ATenNodesParametrizedTest.MatmulNodeSymbolic/10, where GetParam() = ({ 1, 32 }, { 32, 1 })

It is likely because there no reductions identified since we use ATen, and all the dimensions in the output are broadcast dimensions. So the cases where M/N = 1 get picked by NoOp

tests/python/pytest_input_generators.py

version.txt

jacobhinkle

LGTM after tests pass and you add some broadcasts in test.

Priya2698 · 2024-05-14T01:33:33Z

!build

jjsjann123 · 2024-05-14T05:26:45Z

🚀

Priya2698 mentioned this pull request May 7, 2024

Add a new ATen Matmul IR node #2175

Merged

jacobhinkle reviewed May 8, 2024

View reviewed changes

csrc/scheduler/expr_eval_sched.cpp Outdated Show resolved Hide resolved

csrc/scheduler/expr_eval_sched.h Outdated Show resolved Hide resolved

csrc/scheduler/expr_eval_sched.h Outdated Show resolved Hide resolved

csrc/scheduler/expr_eval_sched.h Outdated Show resolved Hide resolved

Priya2698 force-pushed the pm/aten_scheduler branch 2 times, most recently from 2a2786e to 314ab22 Compare May 9, 2024 22:30

Priya2698 and others added 25 commits May 9, 2024 22:37

add matmul node, create output tensorview

a695299

wip scheduler

961ea76

add to dispatch, scheduler heuristic, registry

fe23404

root map override

075a83f

rebase

a7a15a8

mapping for matmul ir node

15b9062

use higher dim inp to create output

44d13c4

remove scheduler"

0cf2e6b

review comments

bb52526

modify pairwise matching

6c130d7

modify matmul out allocation

6bb6031

move mapping logic to another function

b75d72f

use mapping in root domain

54314bf

comment

22031f2

add dot product case

76fd527

format

dc45b68

1D case, review comments

5cb9fe5

move common code

2047629

format

bc86019

review comments

6d14c32

lin

9910676

add to scheduler

b9613f0

check for matmul op

6a059ab

Update csrc/scheduler/expr_eval_sched.cpp

5b9a4a3

Co-authored-by: Jacob Hinkle <1454944+jacobhinkle@users.noreply.github.com>

remove unused functions, add defaults to heuristic param

1ad32ea

Priya2698 force-pushed the pm/aten_scheduler branch from 314ab22 to 1ad32ea Compare May 9, 2024 22:37

Priya2698 added 7 commits May 10, 2024 02:03

change scheduler order

a1b22e2

fix comparison

bcd5791

chech broadcast and symbolic conditions

7183408

rename API

ffd48df

modify matmul generator to use cases from Thunder

8f80548

refactor code

a11c4a9

bump version

d051311

Priya2698 marked this pull request as ready for review May 13, 2024 23:57

Priya2698 requested review from jacobhinkle, jjsjann123 and protonu May 13, 2024 23:57

jacobhinkle requested changes May 14, 2024

View reviewed changes

review comments

41e2bbb

jacobhinkle approved these changes May 14, 2024

View reviewed changes

Priya2698 added 2 commits May 14, 2024 01:15

review comments

6e3aa0a

format, clangtidy

5786f13

Priya2698 merged commit dfba77a into main May 14, 2024

Priya2698 deleted the pm/aten_scheduler branch May 14, 2024 02:42

jacobhinkle mentioned this pull request May 14, 2024

NoOp scheduler claims zero-dimensional non-trivial fusions #2241

Closed

This was referenced May 14, 2024

Additional features to fill out ops.matmul #2092

Closed

Update nvFuser matmul Lightning-AI/lightning-thunder#419

Merged

Segmentation failure in matmul + reshape fusion #2127

Closed

Investigate adding IR Nodes for linear and matmul #2149

Closed

	bool Fusion::isNoOp() {
	if (exprs().empty()) {
	return true;
	}

	for (auto out_tv : ir_utils::filterByType<TensorView>(outputs())) {
	const std::vector<IterDomain*>& root_dom =
	TensorDomain::noReductions(out_tv->getMaybeRFactorDomain());
	const bool size_zero =
	std::any_of(root_dom.begin(), root_dom.end(), [](IterDomain* id) {
	return id->extent()->isConstScalar() && id->extent()->evaluate() == 0;
	});
	if (!size_zero) {
	return false;
	}
	}

	return true;
	}

	// Check that all outputs are either broadcast or ignored reduction.
	for (auto out_tv : ir_utils::filterByType<TensorView>(fusion->outputs())) {
	auto concrete_dimension = TensorDomain::noReductions(
	TensorDomain::noBroadcasts(out_tv->getLeafDomain()));
	if (!concrete_dimension.empty()) {
	scheduler_debug_utils::canScheduleRejectReason(
	heuristicType(), "output has a concrete dimension");
	return false;
	}
	}

Conversation

Priya2698 commented May 7, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jacobhinkle left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Priya2698 commented May 13, 2024

Uh oh!

jacobhinkle left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

jacobhinkle May 14, 2024

Choose a reason for hiding this comment

Uh oh!

Priya2698 May 14, 2024

Choose a reason for hiding this comment

Uh oh!

jacobhinkle May 14, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jacobhinkle May 14, 2024

Choose a reason for hiding this comment

Uh oh!

Priya2698 May 14, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

jacobhinkle left a comment

Choose a reason for hiding this comment

Uh oh!

Priya2698 commented May 14, 2024

Uh oh!

jjsjann123 commented May 14, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Priya2698 commented May 7, 2024 •

edited

Loading

jacobhinkle May 14, 2024 •

edited

Loading