Avoid gtCloneExpr in HW helper-intrinsics by fiigii · Pull Request #16766 · dotnet/coreclr

fiigii · 2018-03-05T22:38:33Z

This PR changes SSE_Shuffle to internally accept 2 or 3 operands to avoid gtCloneExpr in helper-intrinsics.

The similar techniques can simply solve the gtCloneExpr problems that we are discussing in #16758

@CarolEidt @AndyAyersMS @tannergooding @4creators @mikedn

fiigii · 2018-03-05T22:50:26Z

No Merge, just for discussing.

tannergooding · 2018-03-05T22:55:09Z

            op1     = impPopStack().val;
-            retNode = gtNewSimdHWIntrinsicNode(TYP_SIMD16, op1, gtCloneExpr(op1), gtNewIconNode(0), NI_SSE_Shuffle,
-                                               TYP_FLOAT, simdSize);
+            retNode = gtNewSimdHWIntrinsicNode(TYP_SIMD16, op1, gtNewIconNode(0), NI_SSE_Shuffle, TYP_FLOAT, simdSize);


Wouldn't this require more changes in codegen to ensure that the appropriate overload of emitIns_SIMD is called?

Probably not, I saw the Vector<T> code always uses shuffle as 2-op form. Let me investigate more.

It is not necessary to get new internal overload for SSE_Shuffle see comment: #16758 (comment)

You can substitute:

retNode = gtNewSimdHWIntrinsicNode(TYP_SIMD16, op1, gtNewIconNode(0), NI_SSE2_Shuffle, TYP_INT, simdSize);

I think it's safe as I do not know any processors up to 10 years back which would support SSE and not support SSE2

Otherwise, what I have seen in #16758 with Compiler::fgMakeMultiUse works really well with HW intrinsics.

Otherwise, what I have seen in #16758 with Compiler::fgMakeMultiUse works really well with HW intrinsics.

Why not adopt a simpler solution?

I would just use SSE2_Shuffle we just have it already.

I just found that this solution does not work because VEX-encoding always duplicates dst rather than src for this instruction.

Avoid CloneExpr in HW helper-intrinsics

6eed9c0

tannergooding added the * NO MERGE * The PR is not ready for merge yet (see discussion for detailed reasons) label Mar 5, 2018

tannergooding reviewed Mar 5, 2018

View reviewed changes

fiigii closed this Mar 5, 2018

fiigii deleted the cloneexp branch March 5, 2018 23:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Avoid gtCloneExpr in HW helper-intrinsics#16766

Avoid gtCloneExpr in HW helper-intrinsics#16766
fiigii wants to merge 1 commit into
dotnet:masterfrom
fiigii:cloneexp

fiigii commented Mar 5, 2018

Uh oh!

fiigii commented Mar 5, 2018

Uh oh!

tannergooding Mar 5, 2018

Uh oh!

fiigii Mar 5, 2018

Uh oh!

4creators Mar 5, 2018 •

edited

Loading

Uh oh!

4creators Mar 5, 2018 •

edited

Loading

Uh oh!

fiigii Mar 5, 2018

Uh oh!

4creators Mar 5, 2018

Uh oh!

fiigii Mar 5, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

fiigii commented Mar 5, 2018

Uh oh!

fiigii commented Mar 5, 2018

Uh oh!

tannergooding Mar 5, 2018

Choose a reason for hiding this comment

Uh oh!

fiigii Mar 5, 2018

Choose a reason for hiding this comment

Uh oh!

4creators Mar 5, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

4creators Mar 5, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

fiigii Mar 5, 2018

Choose a reason for hiding this comment

Uh oh!

4creators Mar 5, 2018

Choose a reason for hiding this comment

Uh oh!

fiigii Mar 5, 2018

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

4creators Mar 5, 2018 •

edited

Loading

4creators Mar 5, 2018 •

edited

Loading