JIT: Handle edge cases around BlendVariableMask rewrite by saucecontrol · Pull Request #126062 · dotnet/runtime

saucecontrol · 2026-03-24T23:11:52Z

This deals with some edge cases related to BlendVariable being 'upgraded' to BlendVariableMask on import and then possibly 'downgraded' to its original form in rationalize.

First, we were improperly rewriting some blends back to BlendVariable when the mask granularity was incompatible with the pblendvb instruction.

Second, there is logic in place that checks whether the blend could be used as an EVEX embedded mask and rewrites back to BlendVariable if not. However, it misses cases where the mask is created from a vector anyway, and creating the mask just to embed it is a deoptimization.

Copilot

Pull request overview

This PR updates JIT rationalization for xarch NI_AVX512_BlendVariableMask to avoid keeping the mask-form blend when the mask operand originates from a vector-to-mask conversion, preventing a deoptimization where a mask is created only to be embedded.

Changes:

Extend RewriteHWIntrinsicBlendv to detect when the blend mask is produced via NI_AVX512_ConvertVectorToMask.
Avoid the “keep embedded mask” early-return in that scenario so the blend can be rewritten back to the non-mask form.

Comments suppressed due to low confidence (1)

src/coreclr/jit/rationalize.cpp:685

op3 is now a local GenTree*, but it is later passed by address to RewriteHWIntrinsicToNonMask(&op3, ...). RewriteHWIntrinsicToNonMask expects use to be the actual operand edge so it can replace/remove nodes (e.g., it removes NI_AVX512_ConvertVectorToMask and updates the parent via ReplaceOperand). Passing a local pointer means node->Op(3) will not be updated, leaving the blend node still pointing at the removed intrinsic (dangling operand / miscompile). Use GenTree*& op3 = node->Op(3); (or otherwise pass &node->Op(3) / an operand reference) when calling RewriteHWIntrinsicToNonMask.

    GenTree* op2 = node->Op(2);
    GenTree* op3 = node->Op(3);

    // We're in the post-order visit and are traversing in execution order, so
    // everything between op2 and node will have already been rewritten to LIR
    // form and doing the IsInvariantInRange check is safe. This allows us to
    // catch cases where something is embedded masking compatible but where we
    // could never actually contain it and so we want to rewrite it to the non-mask
    // variant
    SideEffectSet scratchSideEffects;

    if (scratchSideEffects.IsLirInvariantInRange(m_compiler, op2, node))
    {
        unsigned  tgtMaskSize     = simdSize / genTypeSize(simdBaseType);
        var_types tgtSimdBaseType = TYP_UNDEF;

        if (op2->isEmbeddedMaskingCompatible(m_compiler, tgtMaskSize, tgtSimdBaseType))
        {
            // Make sure we had a mask to begin with. We don't want to create a mask
            // solely for the purpose of embedding it.

            if (!op3->OperIsHWIntrinsic() ||
                (op3->AsHWIntrinsic()->GetHWIntrinsicId() != NI_AVX512_ConvertVectorToMask))
            {
                // We are going to utilize the embedded mask, so we don't need to rewrite. However,
                // we want to fixup the simdBaseType here since it simplifies lowering and allows
                // both embedded broadcast and the mask to be live simultaneously.

                if (tgtSimdBaseType != TYP_UNDEF)
                {
                    op2->AsHWIntrinsic()->SetSimdBaseType(tgtSimdBaseType);
                }
                return;
            }
        }
    }

    if (!ShouldRewriteToNonMaskHWIntrinsic(op3))
    {
        return;
    }

    parents.Push(op3);
    RewriteHWIntrinsicToNonMask(&op3, parents);
    (void)parents.Pop();

saucecontrol · 2026-03-25T05:45:25Z

cc @dotnet/jit-contrib

SPMI doesn't show any diffs, but this does fix up my motivating case. Something like:

static Vector128<float> AddToNegative(Vector128<float> v1, Vector128<float> v2)
    => Sse41.BlendVariable(v1, v1 + v2, v1);

    vmovups  xmm0, xmmword ptr [rdx]
-   vpmovd2m k1, xmm0
-   vaddps   xmm0 {k1}, xmm0, xmmword ptr [r8]
+   vaddps   xmm1, xmm0, xmmword ptr [r8]
+   vblendvps xmm0, xmm0, xmm1, xmm0
    vmovups  xmmword ptr [rcx], xmm0
    mov      rax, rcx
    ret      
-; Total bytes of code: 24
+; Total bytes of code: 23

JulieLeeMSFT · 2026-04-20T14:51:18Z

@tannergooding, @kg PTAL.

Copilot

Pull request overview

Copilot reviewed 1 out of 1 changed files in this pull request and generated 2 comments.

saucecontrol · 2026-04-23T04:26:19Z

Few diffs now with the bug fix included.

Copilot

Pull request overview

Copilot reviewed 3 out of 3 changed files in this pull request and generated 2 comments.

tannergooding · 2026-04-23T20:34:50Z

CC. @dotnet/jit-contrib, @EgorBo, @kg this should be ready for secondary review. Fixes an issue with the BlendVariableMask -> BlendVariable rewriting and ensures its used in cases where it would be more optimal.

Copilot AI review requested due to automatic review settings March 24, 2026 23:11

github-actions Bot added the area-CodeGen-coreclr CLR JIT compiler in src/coreclr/src/jit and related components such as SuperPMI label Mar 24, 2026

dotnet-policy-service Bot added the community-contribution Indicates that the PR has been added by a community member label Mar 24, 2026

Copilot started reviewing on behalf of saucecontrol March 24, 2026 23:12 View session

Copilot AI reviewed Mar 24, 2026

View reviewed changes

rewrite BlendVariableMask when mask is created from vector

f8fae7f

saucecontrol force-pushed the less-mask branch from ab8d220 to f8fae7f Compare March 25, 2026 02:37

This was referenced Mar 25, 2026

The Operation will be canceled. The next steps may not contain expected logs. dotnet/dnceng#3008

Open

[android-arm64] The Operation will be canceled. The next steps may not contain expected logs. dotnet/dnceng#6408

Open

JulieLeeMSFT assigned saucecontrol Apr 20, 2026

JulieLeeMSFT requested a review from tannergooding April 20, 2026 14:50

JulieLeeMSFT requested a review from kg April 20, 2026 14:51

Merge branch 'main' into less-mask

a934fad

Copilot AI review requested due to automatic review settings April 20, 2026 14:51

Copilot started reviewing on behalf of tannergooding April 20, 2026 14:52 View session

JulieLeeMSFT assigned kg Apr 20, 2026

Copilot AI reviewed Apr 20, 2026

View reviewed changes

Comment thread src/coreclr/jit/rationalize.cpp Outdated

Comment thread src/coreclr/jit/rationalize.cpp Outdated

tannergooding reviewed Apr 20, 2026

View reviewed changes

Comment thread src/coreclr/jit/rationalize.cpp Outdated

JulieLeeMSFT added the needs-author-action An issue or pull request that requires more info or actions from the author. label Apr 20, 2026

This was referenced Apr 20, 2026

[wasm] WBT SatelliteAssembliesTests.CheckThatSatelliteAssembliesAreNotAOTed failing #90458

Open

browser-wasm linux Release LibraryTests queues timing out #117974

Open

System.Net.NameResolution.Tests DNS failures: Name or service not known #126641

Open

fix 127260

f7c1788

dotnet-policy-service Bot removed the needs-author-action An issue or pull request that requires more info or actions from the author. label Apr 23, 2026

saucecontrol changed the title ~~JIT: Rewrite BlendVariableMask when mask is created from vector~~ JIT: Handle edge cases around BlendVariableMask rewrite Apr 23, 2026

This was referenced Apr 23, 2026

XHarness package install failure on iOS due to devicectl NSPOSIXErrorDomain error 49 #123796

Open

Build error: ilc exited with code 57005 #124976

Open

build-analysis Bot mentioned this pull request Apr 23, 2026

ProcessSafeHandle_WaitForExitOrKillOnCancellationAsync_KillsOnCancellation failuring in CI #127287

Closed

Copilot AI mentioned this pull request Apr 23, 2026

Fix race condition: set _canceled before SignalCore in ProcessWaitState #127312

Merged

tannergooding reviewed Apr 23, 2026

View reviewed changes

Comment thread src/coreclr/jit/rationalize.cpp Outdated

allow re-typing the blend

17130ab

Copilot AI review requested due to automatic review settings April 23, 2026 20:15

Copilot started reviewing on behalf of saucecontrol April 23, 2026 20:16 View session

early return

ae2c431

Copilot AI reviewed Apr 23, 2026

View reviewed changes

Comment thread src/tests/JIT/Regression/JitBlue/Runtime_127260/Runtime_127260.cs Outdated

Comment thread src/tests/JIT/Regression/JitBlue/Runtime_127260/Runtime_127260.cs

formatting

902ef7b

tannergooding approved these changes Apr 23, 2026

View reviewed changes

EgorBo approved these changes Apr 24, 2026

View reviewed changes

tannergooding merged commit a75526a into dotnet:main Apr 24, 2026
137 of 141 checks passed

saucecontrol deleted the less-mask branch April 24, 2026 17:00

dotnet-maestro Bot mentioned this pull request Apr 25, 2026

[main] Source code updates from dotnet/runtime dotnet/dotnet#6272

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

JIT: Handle edge cases around BlendVariableMask rewrite#126062

JIT: Handle edge cases around BlendVariableMask rewrite#126062
tannergooding merged 6 commits intodotnet:mainfrom
saucecontrol:less-mask

saucecontrol commented Mar 24, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

saucecontrol commented Mar 25, 2026

Uh oh!

JulieLeeMSFT commented Apr 20, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

saucecontrol commented Apr 23, 2026

Uh oh!

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

tannergooding commented Apr 23, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Conversation

saucecontrol commented Mar 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

saucecontrol commented Mar 25, 2026

Uh oh!

JulieLeeMSFT commented Apr 20, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

saucecontrol commented Apr 23, 2026

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

tannergooding commented Apr 23, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

saucecontrol commented Mar 24, 2026 •

edited

Loading