Change VEX-encoding selection to avoid AVX-SSE transition penalties by fiigii · Pull Request #15014 · dotnet/coreclr

fiigii · 2017-11-13T22:32:42Z

This PR changes VEX-encoding selection to resolve #14065.

Continue to decouple SIMD support level from instruction set value of AVX/AVX2.
- Separate UseAVX to two flags: UseVEXEncoding (AVX supported) and compiler->getSIMDSupportLevel() == SIMD_AVX2_Supported.
Move the bar of using VEX encoding to AVX.
Move the bar of inserting VZERROUPPER to AVX.
Change codgen of genSIMDScalarMove (constructors of Vector2/3/4).

fiigii · 2017-11-14T00:19:37Z

Tests passed on local Skylake (AVX2) and Ivy Bridge (AVX) machines.

fiigii · 2017-11-14T00:20:03Z

@CarolEidt @BruceForstall PTAL

BruceForstall · 2017-11-14T00:58:35Z

cc @dotnet/jit-contrib

CarolEidt · 2017-11-14T01:15:52Z

Add one more SIMD level SIMD_AVX_Supported.

I believe that we explicitly do not want an additional SIMD level, as we don't want to multiply our test burden.

CarolEidt

I want to be sure that:

We are not generating AVX instructions in ngen/crossgen, and
We aren't generating different code for AVX, aside from the encodings.

CarolEidt · 2017-11-14T01:12:55Z

-
-    // COMPlus_EnableAVX can be used to disable using AVX if available on a target machine.
-    opts.compCanUseAVX = false;
-    if (!jitFlags.IsSet(JitFlags::JIT_FLAG_PREJIT) && jitFlags.IsSet(JitFlags::JIT_FLAG_USE_AVX2))


Where is this being handled now? This was the condition that caused us not to generate AVX code during crossgen, as we can't be assured that the target will be the same.

CarolEidt · 2017-11-14T01:15:16Z

 {
    assert(varTypeIsFloating(baseType));
-    if (compiler->getSIMDSupportLevel() == SIMD_AVX2_Supported)
+    if (compiler->getSIMDSupportLevel() >= SIMD_AVX_Supported)


I don't believe we want to generate different code for the AVX case, to avoid multiplying our test matrix.

But here is special. vmovss should use the sematics of "merge" (vmovss xmm1, xmm1, xmm2) rather than semtanc of "move`` (vmovss xmm1, xmm2, xmm2). Let me try to give a better solution.

fiigii · 2017-11-14T03:32:52Z

I believe that we explicitly do not want an additional SIMD level, as we don't want to multiply our test burden.

@CarolEidt I see, will change.

fiigii · 2017-11-14T03:52:06Z

-                if (configEnableISA(InstructionSet_AVX2))
+                // COMPlus_EnableAVX is also used to control the code generation of
+                // System.Numerics.Vectors and floating-point arithmetics
+                if (configEnableISA(InstructionSet_AVX) && configEnableISA(InstructionSet_AVX2))


Where is this being handled now? This was the condition that caused us not to generate AVX code during crossgen, as we can't be assured that the target will be the same.

@CarolEidt I am using InstructionSet_AVX and InstructionSet_AVX2 instead of UseAVX, which is already guarded by !jitFlags.IsSet(JitFlags::JIT_FLAG_PREJIT).

fiigii · 2017-11-14T18:57:45Z

Update

Remove SIMD_AVX_Supported, and only use canUseVexEncoding() and SIMD_AVX2_Supported.
Cleanup genSIMDScalarMove().

fiigii force-pushed the vexencoding branch 3 times, most recently from 6448a36 to fe06214 Compare November 14, 2017 00:16

fiigii changed the title ~~[WIP] Change VEX-encoding selection to avoid AVX-SSE transition penalties~~ Change VEX-encoding selection to avoid AVX-SSE transition penalties Nov 14, 2017

BruceForstall requested review from BruceForstall and CarolEidt November 14, 2017 00:58

CarolEidt suggested changes Nov 14, 2017

View reviewed changes

fiigii commented Nov 14, 2017

View reviewed changes

fiigii force-pushed the vexencoding branch 3 times, most recently from d49bb84 to 7cd4a89 Compare November 14, 2017 18:33

Change VEX-encoding selection to avoid AVX-SSE transition penalties

746daa1

fiigii force-pushed the vexencoding branch from 7cd4a89 to 746daa1 Compare November 14, 2017 18:54

CarolEidt approved these changes Nov 14, 2017

View reviewed changes

CarolEidt merged commit 9369b27 into dotnet:master Nov 14, 2017

fiigii mentioned this pull request Dec 22, 2017

Implement simple Sse2 hardware instrinsics #15585

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Change VEX-encoding selection to avoid AVX-SSE transition penalties#15014

Change VEX-encoding selection to avoid AVX-SSE transition penalties#15014
CarolEidt merged 1 commit into
dotnet:masterfrom
fiigii:vexencoding

fiigii commented Nov 13, 2017 •

edited

Loading

Uh oh!

fiigii commented Nov 14, 2017

Uh oh!

fiigii commented Nov 14, 2017

Uh oh!

BruceForstall commented Nov 14, 2017

Uh oh!

CarolEidt commented Nov 14, 2017

Uh oh!

CarolEidt left a comment

Uh oh!

CarolEidt Nov 14, 2017

Uh oh!

CarolEidt Nov 14, 2017

Uh oh!

fiigii Nov 14, 2017

Uh oh!

fiigii commented Nov 14, 2017

Uh oh!

fiigii Nov 14, 2017

Uh oh!

fiigii commented Nov 14, 2017 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

fiigii commented Nov 13, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

fiigii commented Nov 14, 2017

Uh oh!

fiigii commented Nov 14, 2017

Uh oh!

BruceForstall commented Nov 14, 2017

Uh oh!

CarolEidt commented Nov 14, 2017

Uh oh!

CarolEidt left a comment

Choose a reason for hiding this comment

Uh oh!

CarolEidt Nov 14, 2017

Choose a reason for hiding this comment

Uh oh!

CarolEidt Nov 14, 2017

Choose a reason for hiding this comment

Uh oh!

fiigii Nov 14, 2017

Choose a reason for hiding this comment

Uh oh!

fiigii commented Nov 14, 2017

Uh oh!

fiigii Nov 14, 2017

Choose a reason for hiding this comment

Uh oh!

fiigii commented Nov 14, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

fiigii commented Nov 13, 2017 •

edited

Loading

fiigii commented Nov 14, 2017 •

edited

Loading