SocketAsyncEngine.Unix: improve performance of context lookup by tmds · Pull Request #36358 · dotnet/corefx

tmds · 2019-03-26T09:54:19Z

No description provided.

tmds · 2019-03-26T09:55:52Z

@davidfowl @sebastienros can you benchmark this to see if it matters in scenarios you care about?

tmds · 2019-03-26T11:06:17Z

Linux x64_Release has infrastructure related failure:

/home/vsts_azpcontainer/.nuget/packages/microsoft.dotnet.helix.sdk/2.0.0-beta.19171.6/tools/Microsoft.DotNet.Helix.Sdk.MonoQueue.targets(51,5): error : ArgumentException: Provided Job List Uri is not accessible [/__w/1/s/eng/sendtohelix.proj]
/home/vsts_azpcontainer/.nuget/packages/microsoft.dotnet.helix.sdk/2.0.0-beta.19171.6/tools/Microsoft.DotNet.Helix.Sdk.MonoQueue.targets(51,5): error :    at Microsoft.DotNet.Helix.Client.HelixApi.HandleFailedRequest(RestApiException ex) in /_/src/Microsoft.DotNet.Helix/Client/CSharp/HelixApi.cs:line 29 [/__w/1/s/eng/sendtohelix.proj]
/home/vsts_azpcontainer/.nuget/packages/microsoft.dotnet.helix.sdk/2.0.0-beta.19171.6/tools/Microsoft.DotNet.Helix.Sdk.MonoQueue.targets(51,5): error :    at Microsoft.DotNet.Helix.Client.Job.NewInternalAsync(JobCreationRequest body, CancellationToken cancellationToken) in /_/src/Microsoft.DotNet.Helix/Client/CSharp/generated-code/Job.cs:line 259 [/__w/1/s/eng/sendtohelix.proj]
/home/vsts_azpcontainer/.nuget/packages/microsoft.dotnet.helix.sdk/2.0.0-beta.19171.6/tools/Microsoft.DotNet.Helix.Sdk.MonoQueue.targets(51,5): error :    at Microsoft.DotNet.Helix.Client.Job.NewAsync(JobCreationRequest body, CancellationToken cancellationToken) in /_/src/Microsoft.DotNet.Helix/Client/CSharp/generated-code/Job.cs:line 190 [/__w/1/s/eng/sendtohelix.proj]
/home/vsts_azpcontainer/.nuget/packages/microsoft.dotnet.helix.sdk/2.0.0-beta.19171.6/tools/Microsoft.DotNet.Helix.Sdk.MonoQueue.targets(51,5): error :    at Microsoft.DotNet.Helix.Client.HelixApi.RetryAsync[T](Func`1 function, Action`1 logRetry, Func`2 isRetryable) [/__w/1/s/eng/sendtohelix.proj]
/home/vsts_azpcontainer/.nuget/packages/microsoft.dotnet.helix.sdk/2.0.0-beta.19171.6/tools/Microsoft.DotNet.Helix.Sdk.MonoQueue.targets(51,5): error :    at Microsoft.DotNet.Helix.Client.JobDefinition.SendAsync(Action`1 log) in /_/src/Microsoft.DotNet.Helix/JobSender/JobDefinition.cs:line 206 [/__w/1/s/eng/sendtohelix.proj]
/home/vsts_azpcontainer/.nuget/packages/microsoft.dotnet.helix.sdk/2.0.0-beta.19171.6/tools/Microsoft.DotNet.Helix.Sdk.MonoQueue.targets(51,5): error :    at Microsoft.DotNet.Helix.Sdk.SendHelixJob.ExecuteCore() in /_/src/Microsoft.DotNet.Helix/Sdk/SendHelixJob.cs:line 234 [/__w/1/s/eng/sendtohelix.proj]
/home/vsts_azpcontainer/.nuget/packages/microsoft.dotnet.helix.sdk/2.0.0-beta.19171.6/tools/Microsoft.DotNet.Helix.Sdk.MonoQueue.targets(51,5): error :    at Microsoft.DotNet.Helix.Sdk.HelixTask.Execute() in /_/src/Microsoft.DotNet.Helix/Sdk/HelixTask.cs:line 43 [/__w/1/s/eng/sendtohelix.proj]
/home/vsts_azpcontainer/.nuget/packages/microsoft.dotnet.helix.sdk/2.0.0-beta.19171.6/tools/Microsoft.DotNet.Helix.Sdk.MonoQueue.targets(51,5): error : RestApiException: The response contained an invalid status code 400 Bad Request [/__w/1/s/eng/sendtohelix.proj]
/home/vsts_azpcontainer/.nuget/packages/microsoft.dotnet.helix.sdk/2.0.0-beta.19171.6/tools/Microsoft.DotNet.Helix.Sdk.MonoQueue.targets(51,5): error :  [/__w/1/s/eng/sendtohelix.proj]
/home/vsts_azpcontainer/.nuget/packages/microsoft.dotnet.helix.sdk/2.0.0-beta.19171.6/tools/Microsoft.DotNet.Helix.Sdk.MonoQueue.targets(51,5): error : Body: {"Message":"Provided Job List Uri is not accessible"} [/__w/1/s/eng/sendtohelix.proj]
/home/vsts_azpcontainer/.nuget/packages/microsoft.dotnet.helix.sdk/2.0.0-beta.19171.6/tools/Microsoft.DotNet.Helix.Sdk.MonoQueue.targets(51,5): error :  [/__w/1/s/eng/sendtohelix.proj]
/home/vsts_azpcontainer/.nuget/packages/microsoft.dotnet.helix.sdk/2.0.0-beta.19171.6/tools/Microsoft.DotNet.Helix.Sdk.MonoQueue.targets(51,5): error :  [/__w/1/s/eng/sendtohelix.proj]

@dotnet-bot test corefx-ci (Linux x64_Release) please

tmds · 2019-03-26T13:35:10Z

/azp run corefx-ci (Linux x64_Release)

tmds · 2019-03-26T13:45:25Z

/azp run corefx-ci

stephentoub · 2019-03-26T14:08:45Z

            try
            {
                bool shutdown = false;
+                SocketAsyncContext[] contexts = new SocketAsyncContext[EventBufferCount];


Could we make _handleToContextMap a ConcurrentDictionary<IntPtr, SocketAsyncContext>? That would result in an allocation on every add, but adds are rare-ish, only when new sockets are added (right?), in which case there's already other allocation happening (e.g. the socket itself and all associated state), and we could avoid the lock on reads entirely, plus avoid this array allocation per event loop, and avoid needing to iterate through the events twice.

I'll make a branch that implements this and we can benchmark both.

This branch uses ConcurrentDictionary: tmds@9cc7527. @stephentoub , you can add some review comments on that commit if you want.

Thanks. I prefer the ConcurrentDictionary version, but we should see what perf looks like for both.

sebastienros · 2019-03-26T16:38:13Z

@tmds Would you mind sharing the dlls with and without the changes you want to benchmark? If you send me an email or tweet I can create a shared folder for you to drop the files in.

…or context lookup

stephentoub · 2019-03-28T13:50:09Z

@sebastienros, @tmds, were you guys able to get any benchmarking done? Anything I can help with?

tmds · 2019-03-28T13:58:44Z

@sebastienros, @tmds, were you guys able to get any benchmarking done? Anything I can help with?

I sent dlls to Sébastien. I didn't get benchmark results yet.

tmds · 2019-03-28T19:41:37Z

I got these benchmark results from @sebastienros:

Description	RPS	CPU (%)	Memory (MB)	Avg. Latency (ms)	Startup (ms)	First Request (ms)	Latency (ms)	Ratio
Baseline	471,819	98	174	0.84	235	80.06	0.55	1.00
Lookup	473,732	98	174	0.84	231	81.65	0.75	1.00
Concurrent	472,634	98	175	0.89	230	79.38	0.65	1.00

That is: a 0.17% increase with the ConcurrentDictionary, and a 0.4% increase with Dictionary + lock.

[This benchmark is: Benchmarked Plaintext non-pipelined on Linux. Latest runtime, aspnet and sdk. All runs done 5 times, excluding highest lowest result for each run and average the 3 remaining ones.]

stephentoub · 2019-03-28T19:44:19Z

That is: a 0.17% increase with the ConcurrentDictionary, and a 0.4% increase with Dictionary + lock.

Thanks. How representative of access patterns do we think this test is? If it's representative, the dictionary+lock seems fine. But if we expect there may be other access patterns, I'd be tempted to suggest the concurrent dictionary route, since it won't ever have contention on reads, whereas the dictionary+lock does, and with a global lock.

tmds · 2019-04-05T08:04:13Z

With these small gains, we can just go for the simpler implementation. I'm changing this to the ConcurrentDictionary version.

tmds · 2019-04-08T12:02:42Z

Changed to the ConcurrentDictionary implementation.

tmds · 2019-04-09T02:32:13Z

@stephentoub probably this is ok to merge?

tmds · 2019-04-11T07:40:56Z

@wfurt @davidsh if this looks good to you, can you merge this please?

benaadams · 2019-04-16T14:21:57Z

Regular Dictionary with IntPtr key should be much faster than a ConcurrentDictionary, due to the optimizations it has around struct keys; then if the lock is changed from a static lock to an instance lock; it should generally be uncontended on the receive? (unlike the static lock)

stephentoub · 2019-04-16T14:28:35Z

Regular Dictionary with IntPtr key should be much faster than a ConcurrentDictionary, due to the optimizations it has around struct keys; then if the lock is changed from a static lock to an instance lock; it should generally be uncontended on the receive

Doesn't matter if there's contention; the lock still adds non-trivial overheads...

using BenchmarkDotNet.Attributes;
using BenchmarkDotNet.Running;
using System;
using System.Collections.Generic;
using System.Collections.Concurrent;

[InProcess]
[MemoryDiagnoser]
public class Test
{
    public static void Main()
    {
        BenchmarkRunner.Run<Test>();
    }

    public Test()
    {
        for (int i = 0; i < 1000; i++)
        {
            _cd.TryAdd((IntPtr)i, new object());
            _d.Add((IntPtr)i, new object());
        }
    }

    private ConcurrentDictionary<IntPtr, object> _cd = new ConcurrentDictionary<IntPtr, object>();
    private Dictionary<IntPtr, object> _d = new Dictionary<IntPtr, object>();

    [Benchmark]
    public bool Concurrent()
    {
        bool result = true;
        for (int i = 0; i < 1000; i++) result &= _cd.TryGetValue((IntPtr)i, out _);
        return result;
    }

    [Benchmark]
    public bool Locked()
    {
        bool result = true;
        for (int i = 0; i < 1000; i++) lock (_d) result &= _d.TryGetValue((IntPtr)i, out _);
        return result;
    }
}

     Method |     Mean |     Error |    StdDev | Gen 0/1k Op | Gen 1/1k Op | Gen 2/1k Op | Allocated Memory/Op |
----------- |---------:|----------:|----------:|------------:|------------:|------------:|--------------------:|
 Concurrent | 10.75 us | 0.2015 us | 0.1885 us |           - |           - |           - |                   - |
     Locked | 24.87 us | 0.4959 us | 0.4870 us |           - |           - |           - |                   - |

…or context lookup (dotnet/corefx#36358) Commit migrated from dotnet/corefx@1c881ce

stephentoub reviewed Mar 26, 2019

View reviewed changes

Comment thread src/System.Net.Sockets/src/System/Net/Sockets/SocketAsyncEngine.Unix.cs Outdated

stephentoub reviewed Mar 26, 2019

View reviewed changes

Comment thread src/System.Net.Sockets/src/System/Net/Sockets/SocketAsyncEngine.Unix.cs

stephentoub reviewed Mar 26, 2019

View reviewed changes

davidsh added os-linux Linux OS (any supported distro) area-System.Net.Sockets labels Mar 26, 2019

davidsh added this to the 3.0 milestone Mar 26, 2019

ericstj mentioned this pull request Mar 26, 2019

Helix submission failing with ArgumentException: Provided Job List Uri is not accessible dotnet/arcade#2341

Closed

SocketAsyncEngine.Unix: use ConcurrentDictionary to get rid of lock f…

9cc7527

…or context lookup

karelz assigned tmds, davidsh and wfurt Apr 1, 2019

tmds force-pushed the socket_context_lookup branch from 36b7c23 to 9cc7527 Compare April 8, 2019 12:01

tmds mentioned this pull request Apr 8, 2019

SocketAsyncEngine.Unix: remove multiple engines #36693

Closed

stephentoub approved these changes Apr 9, 2019

View reviewed changes

stephentoub merged commit 1c881ce into dotnet:master Apr 11, 2019

benaadams mentioned this pull request Apr 15, 2019

Big fall in Json, MVC plaintext, MVC Json dotnet/aspnetcore#9388

Closed

picenka21 pushed a commit to picenka21/runtime that referenced this pull request Feb 18, 2022

SocketAsyncEngine.Unix: use ConcurrentDictionary to get rid of lock f…

2478379

…or context lookup (dotnet/corefx#36358) Commit migrated from dotnet/corefx@1c881ce

Conversation

tmds commented Mar 26, 2019

Uh oh!

tmds commented Mar 26, 2019

Uh oh!

tmds commented Mar 26, 2019

Uh oh!

tmds commented Mar 26, 2019

Uh oh!

tmds commented Mar 26, 2019

Uh oh!

Uh oh!

Uh oh!

stephentoub Mar 26, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

tmds Mar 26, 2019

Choose a reason for hiding this comment

Uh oh!

tmds Mar 26, 2019

Choose a reason for hiding this comment

Uh oh!

stephentoub Mar 27, 2019

Choose a reason for hiding this comment

Uh oh!

sebastienros commented Mar 26, 2019

Uh oh!

stephentoub commented Mar 28, 2019

Uh oh!

tmds commented Mar 28, 2019

Uh oh!

tmds commented Mar 28, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

stephentoub commented Mar 28, 2019

Uh oh!

tmds commented Apr 5, 2019

Uh oh!

tmds commented Apr 8, 2019

Uh oh!

tmds commented Apr 9, 2019

Uh oh!

tmds commented Apr 11, 2019

Uh oh!

benaadams commented Apr 16, 2019

Uh oh!

stephentoub commented Apr 16, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

stephentoub Mar 26, 2019 •

edited

Loading

tmds commented Mar 28, 2019 •

edited

Loading

stephentoub commented Apr 16, 2019 •

edited

Loading