Fixing up the Sse41.Insert float HWIntrinsics by tannergooding · Pull Request #18735 · dotnet/coreclr

tannergooding · 2018-06-30T22:05:21Z

This adds back the Sse41.Insert float tests (temporarily removed due to the API changes made in Improve Intel hardware intrinsic APIs #17637)
This fixes the Sse41.Insert float implementation to support immediate values greater than 0x3F

tannergooding · 2018-06-30T22:05:33Z

tannergooding · 2018-06-30T22:06:40Z

    ("SimpleUnOpTest.template",      new Dictionary<string, string> { ["Isa"] = "Sse41", ["LoadIsa"] = "Sse",  ["Method"] = "Floor",                         ["RetVectorType"] = "Vector128", ["RetBaseType"] = "Single",  ["Op1VectorType"] ="Vector128", ["Op1BaseType"] = "Single",                                                                                                                                                                   ["LargestVectorSize"] = "16", ["NextValueOp1"] = "(float)(random.NextDouble())",                                                                                                                                                                ["ValidateFirstResult"] = "BitConverter.SingleToInt32Bits(result[0]) != BitConverter.SingleToInt32Bits(MathF.Floor(firstOp[0]))",                                                                                                                                               ["ValidateRemainingResults"] = "BitConverter.SingleToInt32Bits(result[i]) != BitConverter.SingleToInt32Bits(MathF.Floor(firstOp[i]))"}),
    ("SimpleBinOpTest.template",     new Dictionary<string, string> { ["Isa"] = "Sse41", ["LoadIsa"] = "Sse2", ["Method"] = "FloorScalar",                   ["RetVectorType"] = "Vector128", ["RetBaseType"] = "Double",  ["Op1VectorType"] ="Vector128", ["Op1BaseType"] = "Double", ["Op2VectorType"] = "Vector128", ["Op2BaseType"] = "Double",                                                                                                      ["LargestVectorSize"] = "16", ["NextValueOp1"] = "(double)(random.NextDouble())",                        ["NextValueOp2"] = "(double)(random.NextDouble())",                                                                                    ["ValidateFirstResult"] = "BitConverter.DoubleToInt64Bits(result[0]) != BitConverter.DoubleToInt64Bits(Math.Floor(right[0]))",                                                                                                                                                  ["ValidateRemainingResults"] = "BitConverter.DoubleToInt64Bits(result[i]) != BitConverter.DoubleToInt64Bits(left[i])"}),
    ("SimpleBinOpTest.template",     new Dictionary<string, string> { ["Isa"] = "Sse41", ["LoadIsa"] = "Sse",  ["Method"] = "FloorScalar",                   ["RetVectorType"] = "Vector128", ["RetBaseType"] = "Single",  ["Op1VectorType"] ="Vector128", ["Op1BaseType"] = "Single", ["Op2VectorType"] = "Vector128", ["Op2BaseType"] = "Single",                                                                                                      ["LargestVectorSize"] = "16", ["NextValueOp1"] = "(float)(random.NextDouble())",                         ["NextValueOp2"] = "(float)(random.NextDouble())",                                                                                     ["ValidateFirstResult"] = "BitConverter.SingleToInt32Bits(result[0]) != BitConverter.SingleToInt32Bits(MathF.Floor(right[0]))",                                                                                                                                                 ["ValidateRemainingResults"] = "BitConverter.SingleToInt32Bits(result[i]) != BitConverter.SingleToInt32Bits(left[i])"}),
+    ("InsertVector128Test.template", new Dictionary<string, string> { ["Isa"] = "Sse41", ["LoadIsa"] = "Sse",  ["Method"] = "Insert",                        ["RetVectorType"] = "Vector128", ["RetBaseType"] = "Single",  ["Op1VectorType"] ="Vector128", ["Op1BaseType"] = "Single", ["Op2VectorType"] = "Vector128", ["Op2BaseType"] = "Single",                                                                                     ["Imm"] = "0",   ["LargestVectorSize"] = "16", ["NextValueOp1"] = "(float)(random.NextDouble())",                         ["NextValueOp2"] = "(float)(random.NextDouble())",                                                                                     ["ValidateFirstResult"] = "BitConverter.SingleToInt32Bits(result[0]) != BitConverter.SingleToInt32Bits(right[0])",                                                                                                                                                              ["ValidateRemainingResults"] = "BitConverter.SingleToInt32Bits(result[i]) != BitConverter.SingleToInt32Bits(left[i])"}),


As usual, tests are auto-generated from this template data, it is worthwhile looking at the base template and the data, but probably not reviewing each test individually.

tannergooding · 2018-06-30T22:07:26Z

                        ssize_t ival = op3->AsIntCon()->IconValue();
                        assert((ival >= 0) && (ival <= 255));
-
-                        if ((intrinsicId == NI_SSE41_Insert) && (baseType == TYP_FLOAT))


No longer required since the new API shape allows a vector for the second operand, and therefore makes the upper two bits relevant.

tannergooding · 2018-06-30T22:10:51Z

+
+                        op3 = argList->Current();
+
+                        // The upper two bits of the immediate value are ignored if


This is important, as per the architecture manuals:

When the scalar source is a memory operand the Count_S bits are ignored.

As such, we can support containment when the control mask (op3) is <= 0x3F. Otherwise, we need to ensure that op2 is in register and should not support any containment.

tannergooding · 2018-06-30T22:11:45Z

Some unrelated AVX tests will fail for NoSIMD and NoAVX, these are being handled by: #18734

tannergooding · 2018-07-02T17:04:21Z

@dotnet-bot test Windows_NT x64 Checked jitincompletehwintrinsic
@dotnet-bot test Windows_NT x64 Checked jitx86hwintrinsicnoavx
@dotnet-bot test Windows_NT x64 Checked jitx86hwintrinsicnoavx2
@dotnet-bot test Windows_NT x64 Checked jitx86hwintrinsicnosimd
@dotnet-bot test Windows_NT x64 Checked jitnox86hwintrinsic

@dotnet-bot test Windows_NT x86 Checked jitincompletehwintrinsic
@dotnet-bot test Windows_NT x86 Checked jitx86hwintrinsicnoavx
@dotnet-bot test Windows_NT x86 Checked jitx86hwintrinsicnoavx2
@dotnet-bot test Windows_NT x86 Checked jitx86hwintrinsicnosimd
@dotnet-bot test Windows_NT x86 Checked jitnox86hwintrinsic

@dotnet-bot test Ubuntu x64 Checked jitincompletehwintrinsic
@dotnet-bot test Ubuntu x64 Checked jitx86hwintrinsicnoavx
@dotnet-bot test Ubuntu x64 Checked jitx86hwintrinsicnoavx2
@dotnet-bot test Ubuntu x64 Checked jitx86hwintrinsicnosimd
@dotnet-bot test Ubuntu x64 Checked jitnox86hwintrinsic

tannergooding · 2018-07-02T17:04:27Z

Rebased onto dotnet/master to pick-up the test fixes

CarolEidt

LGTM

fiigii

LGTM

tannergooding commented Jun 30, 2018

View reviewed changes

tannergooding mentioned this pull request Jun 30, 2018

Some test fixes for the x86 HWIntrinsics #18734

Merged

Fixing up the Sse41.Insert float HWIntrinsics

e73c4f8

CarolEidt approved these changes Jul 2, 2018

View reviewed changes

fiigii approved these changes Jul 2, 2018

View reviewed changes

tannergooding merged commit a7167dd into dotnet:master Jul 2, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fixing up the Sse41.Insert float HWIntrinsics#18735

Fixing up the Sse41.Insert float HWIntrinsics#18735
tannergooding merged 1 commit into
dotnet:masterfrom
tannergooding:hwintrin-sse41-insert

tannergooding commented Jun 30, 2018

Uh oh!

tannergooding commented Jun 30, 2018

Uh oh!

tannergooding Jun 30, 2018

Uh oh!

tannergooding Jun 30, 2018

Uh oh!

tannergooding Jun 30, 2018

Uh oh!

tannergooding commented Jun 30, 2018

Uh oh!

tannergooding commented Jul 2, 2018

Uh oh!

tannergooding commented Jul 2, 2018

Uh oh!

CarolEidt left a comment

Uh oh!

fiigii left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants


		op3 = argList->Current();

		// The upper two bits of the immediate value are ignored if

Conversation

tannergooding commented Jun 30, 2018

Uh oh!

tannergooding commented Jun 30, 2018

Uh oh!

tannergooding Jun 30, 2018

Choose a reason for hiding this comment

Uh oh!

tannergooding Jun 30, 2018

Choose a reason for hiding this comment

Uh oh!

tannergooding Jun 30, 2018

Choose a reason for hiding this comment

Uh oh!

tannergooding commented Jun 30, 2018

Uh oh!

tannergooding commented Jul 2, 2018

Uh oh!

tannergooding commented Jul 2, 2018

Uh oh!

CarolEidt left a comment

Choose a reason for hiding this comment

Uh oh!

fiigii left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants