[Linq] Lower LargeArrayBuilder's ResizeLimit by jamesqo · Pull Request #14738 · dotnet/corefx

jamesqo · 2016-12-28T05:40:44Z

Background: LargeArrayBuilder has a "resize limit" after which it stops resizing its buffer and switches to using chunked arrays. I initially chose this limit to be 32 elements, because I didn't want it to be too small in case sizeof(T) was small, but also didn't want it to be too large in case sizeof(T) was large.

Description: Since writing #14020, I've come to realize that the 90% use case for Linq is with business objects / classes, and not with low-level stuff like bytes. Since reference types are 4/8 bytes wide, it makes sense to optimize for T being wider. This PR lowers the resize limit of LargeArrayBuilder from 32 -> 8 elements.

Performance improvements: gist For reference types, there is a decrease of ~5 GCs for sizes 9-15, and ~10 GCs from 17-31 and 33+. Sizes 16 and 32 regress, which is expected since the array is now chunked at those sizes and the builder can't directly return it.

cc @stephentoub, @JonHanna, @VSadov

VSadov · 2016-12-28T16:09:21Z

I wonder if a number of GC is a meaningful metric of performance.

It seems that overall GC number would depend on size of machine, GC implememtation/mode or its learning heuristics. Some collections could be cheaper than others. Bigger is not necessary worse.

Cpu time or bytes allocated would be more meaningful.

karelz · 2017-01-10T02:33:53Z

@jamesqo any comment on the above feedback? (I agree with @VSadov here)
Do you plan to measure allocated bytes instead?

jamesqo · 2017-01-11T01:17:08Z

@VSadov, @karelz Sorry for the delayed reply. I will try to measure other stats for this when I have the time. It will be tricky to measure CPU time because there are so many virtual method calls here that they completely blot out the overhead of extra allocations, even for larger enumerables. As for bytes allocated the only way I am aware of how to do that is this new API, so I will have to adjust my benchmark app later to build against the corefx myget feed which has that API.

karelz · 2017-01-11T02:21:12Z

there are so many virtual method calls here that they completely blot out the overhead of extra allocations, even for larger enumerables

If you don't expect CPU time to show measurable difference, then focus on a metric which shows it -- allocated bytes in this case.
The key point is that we should have a reasonable metric which shows some improvement on real-world-ish cases. Otherwise we really can't say that the change makes things better, just different :)

As for bytes allocated the only way I am aware of how to do that is this new API

Did you try to use PerfView? It has great video tutorials.
You might want to check xunit-performance if it measures allocated bytes / GC frequency by default (I remember we had those plans two years ago, but I am not sure if it is already implemented). Or maybe BenchmarkDotNet supports it? (disclaimer: I never tried it myself)

cc: @brianrob @vancem @valenis for additional perf guidance on allocated bytes measurements

vancem · 2017-01-11T15:59:49Z

@jamesqo, to do an ad-hoc performance analysis with PerfView is very easy.

There are Videos https://channel9.msdn.com/Series/PerfView-Tutorial and very first one shows you that you are just a couple clicks away from having the tool. From there if you run

PerfView collect

While your benchmark runs it will collect both CPU stacks as well as samples of stacks of GC allocations every 100K of GC bytes allocated. Thus you can quickly get a read of how much allocation happened over the whole benchmark (or any sub-part of it). Running this with and without your change will show you the effect you have had on both allocation as well as CPU time used.

jamesqo · 2017-01-14T00:14:24Z

@vancem Thank you for the advice, very useful. I actually have used PerfView before for measuring CPU time, but wasn't aware that it could be used to measure allocated bytes as well.

@VSadov I have just made a comparison between setting the resize limit to 8 and setting it to 32 with the same gist I posted above using PerfView.

32 elements:

8 elements:

There is ~6MB less of allocated bytes (out of 26MB, so roughly 20%) when changing the limit to 8. Note that is this somewhat offsetted because we are also allocating additional object[][]s to hold the arrays themselves, but that only adds ~1MB and the net difference is still ~5MB.

jamesqo · 2017-01-20T23:18:08Z

@VSadov Maybe you could merge this when you have time? I have posted perf results.

OmarTawfik · 2017-01-23T20:19:21Z

LGTM. @VSadov?

karelz · 2017-01-25T20:55:15Z

@VSadov can we merge? We got 2 reviews ...

VSadov · 2017-01-26T22:02:14Z

LGTM

[Linq] Lower LargeArrayBuilder's ResizeLimit Commit migrated from dotnet/corefx@64bd0a4

Lower LargeArrayBuilder's ResizeLimit

e3bf836

dnfclas added the cla-already-signed label Dec 28, 2016

jamesqo changed the title ~~Lower LargeArrayBuilder's ResizeLimit~~ [Linq] Lower LargeArrayBuilder's ResizeLimit Dec 28, 2016

karelz assigned jamesqo Dec 30, 2016

karelz added the area-System.Linq label Dec 30, 2016

karelz assigned VSadov and OmarTawfik Jan 21, 2017

JonHanna approved these changes Jan 23, 2017

View reviewed changes

VSadov merged commit 64bd0a4 into dotnet:master Jan 26, 2017

jamesqo deleted the smaller-resize-limit branch January 26, 2017 22:17

karelz modified the milestone: 2.0.0 Feb 4, 2017

picenka21 pushed a commit to picenka21/runtime that referenced this pull request Feb 18, 2022

Merge pull request dotnet/corefx#14738 from jamesqo/smaller-resize-limit

2e8ccd3

[Linq] Lower LargeArrayBuilder's ResizeLimit Commit migrated from dotnet/corefx@64bd0a4

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Linq] Lower LargeArrayBuilder's ResizeLimit#14738

[Linq] Lower LargeArrayBuilder's ResizeLimit#14738
VSadov merged 1 commit intodotnet:masterfrom
jamesqo:smaller-resize-limit

jamesqo commented Dec 28, 2016 •

edited

Loading

Uh oh!

VSadov commented Dec 28, 2016

Uh oh!

karelz commented Jan 10, 2017

Uh oh!

jamesqo commented Jan 11, 2017

Uh oh!

karelz commented Jan 11, 2017

Uh oh!

vancem commented Jan 11, 2017

Uh oh!

jamesqo commented Jan 14, 2017 •

edited

Loading

Uh oh!

jamesqo commented Jan 20, 2017

Uh oh!

OmarTawfik commented Jan 23, 2017

Uh oh!

karelz commented Jan 25, 2017

Uh oh!

VSadov commented Jan 26, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

8 participants

Conversation

jamesqo commented Dec 28, 2016 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

VSadov commented Dec 28, 2016

Uh oh!

karelz commented Jan 10, 2017

Uh oh!

jamesqo commented Jan 11, 2017

Uh oh!

karelz commented Jan 11, 2017

Uh oh!

vancem commented Jan 11, 2017

Uh oh!

jamesqo commented Jan 14, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jamesqo commented Jan 20, 2017

Uh oh!

OmarTawfik commented Jan 23, 2017

Uh oh!

karelz commented Jan 25, 2017

Uh oh!

VSadov commented Jan 26, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

8 participants

jamesqo commented Dec 28, 2016 •

edited

Loading

jamesqo commented Jan 14, 2017 •

edited

Loading