Evaluate `Matmul+Bias` by Priya2698 · Pull Request #1993 · NVIDIA/Fuser

Priya2698 · 2024-03-25T06:03:19Z

Adds ATen evaluation for Matmul and Matmul + Bias. Based on PR #1921, when evaluating a castOp, we look back to see if there is a preceding MmaOp and evaluate them together.

Issue #1775.

Priya2698 · 2024-03-25T06:03:26Z

!build

csrc/ir/nodes.cpp

csrc/ir/utils.cpp

csrc/ir/nodes.cpp

csrc/ir/utils.cpp

wujingyue

LGTM with comments! Make sure you address Jacob's as well.

csrc/ir/utils.h

wujingyue · 2024-03-27T23:14:20Z

csrc/ir/utils.cpp

+// 2. The inputs to MmaOp are broadcasted as the last dim for the first operand
+// and the first dim for the second operand.
+// The inputs of MmaOp will be [M, K, 1] x [1, K, N].
+// Additionally, the inputs to the MmaOp should of the `expected_input_dtype`.


For posterity, can you explain why this is required?

There is a comment on line 1351 reasoning about the dtypes. Do you want to move it upwards?

csrc/ir/utils.cpp

wujingyue · 2024-03-27T23:22:46Z

csrc/ir/utils.cpp

+        Val* bcast_bias = binary->input(1);
+        // Bias is casted to fp32 and broadcasted from shape [M,] to [M, 1] in
+        // biasEpilogue.
+        NVF_ERROR(


I'm not sure about NVF_CHECK vs NVF_ERROR. But this check and several others down the road seems to be of the same nature as those in verifyMmaOpForEvaluation. Should all of them use the same macro?

I am using NVF_ERROR for all. Will make sure to replace any NVF_CHECK with NVF_ERROR. My reasoning, is that this is a case that should not have occured under our current matmul assumptions.

csrc/ir/utils.cpp

Priya2698 · 2024-03-29T00:08:48Z

!build

Priya2698 · 2024-03-29T03:13:02Z

!build

protonu · 2024-04-02T02:12:42Z

csrc/ir/nodes.cpp

+
+    Val* bias = nullptr;
+    // Case 2: Matmul + Bias
+    if (MmaOpUtils::matchMatmulBiasCast(


I am not sure how to address this here, but what worries me is torch.nn.functional.linear takes x.A(T)+bias. So unless we transpose A, we won't have [M,K,1] [1,N,K].

https://pytorch.org/docs/stable/generated/torch.nn.functional.linear.html

We may need to write/use a routine to detect the shape.

I'll address this is in the next PR. The assumed shapes are the same as that in matmul, so I was assuming that the transpose would already have been done when needed. We can discuss more today.

Priya2698 added 2 commits March 21, 2024 22:34

move mmaop+cast eval to castop evaluate

52b2a77

matmul + bias evaluation

075af13

Priya2698 added 2 commits March 25, 2024 20:57

comments

c26f063

format

63c1c96

Priya2698 marked this pull request as ready for review March 25, 2024 21:03

fix lint

62898a1

Priya2698 requested review from jacobhinkle, protonu and wujingyue March 25, 2024 21:14

wujingyue reviewed Mar 25, 2024

View reviewed changes

csrc/ir/nodes.cpp Outdated Show resolved Hide resolved

csrc/ir/nodes.cpp Outdated Show resolved Hide resolved

jacobhinkle reviewed Mar 25, 2024

View reviewed changes

csrc/ir/nodes.cpp Outdated Show resolved Hide resolved

csrc/ir/nodes.cpp Outdated Show resolved Hide resolved

csrc/ir/utils.cpp Outdated Show resolved Hide resolved

jacobhinkle reviewed Mar 25, 2024

View reviewed changes

csrc/ir/nodes.cpp Show resolved Hide resolved

merge assignment and pattern matching

17fccb9

wujingyue reviewed Mar 27, 2024

View reviewed changes

csrc/ir/utils.cpp Outdated Show resolved Hide resolved

csrc/ir/utils.cpp Outdated Show resolved Hide resolved

cache tv_a/b

e2e0c74

wujingyue reviewed Mar 27, 2024

View reviewed changes

csrc/ir/utils.cpp Outdated Show resolved Hide resolved

csrc/ir/utils.cpp Outdated Show resolved Hide resolved

csrc/ir/utils.cpp Outdated Show resolved Hide resolved

csrc/ir/utils.cpp Outdated Show resolved Hide resolved

review comments

c44969a

wujingyue reviewed Mar 27, 2024

View reviewed changes

Priya2698 added 2 commits March 28, 2024 00:23

review comments

1035e40

check final cast types

47bc73e

wujingyue approved these changes Mar 28, 2024

View reviewed changes

csrc/ir/utils.cpp Outdated Show resolved Hide resolved

early exit branch

26c6b3e

early exit branch

b02ca71

protonu reviewed Apr 2, 2024

View reviewed changes

Priya2698 merged commit 9c2359f into main Apr 2, 2024

Priya2698 deleted the pm/matmul_single_eval branch April 2, 2024 17:09

Conversation

Priya2698 commented Mar 25, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Priya2698 commented Mar 25, 2024

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

wujingyue left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

wujingyue Mar 27, 2024

Choose a reason for hiding this comment

Uh oh!

Priya2698 Mar 27, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

wujingyue Mar 27, 2024

Choose a reason for hiding this comment

Uh oh!

Priya2698 Mar 27, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Priya2698 commented Mar 29, 2024

Uh oh!

Priya2698 commented Mar 29, 2024

Uh oh!

protonu Apr 2, 2024

Choose a reason for hiding this comment

Uh oh!

Priya2698 Apr 2, 2024

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Priya2698 commented Mar 25, 2024 •

edited

Loading