#2727 Removed unnecessary IsPrime function after expanding table. by SunnyWar · Pull Request #6203 · dotnet/corefx

SunnyWar · 2016-02-18T14:22:51Z

The IsPrime function is only used when the table does not have enough values. This change expands the number of values up to the maximum and removes the, now unneeded, IsPrime function.

Suggest exploring alternative bucket sizes and ways to get the bucket size be deferred till another time.

stephentoub · 2016-02-19T00:33:26Z

-            return (candidate == 2);
-        }
-
+            1674319, 2009191, 2411033, 2893249, 3471899, 4166287, 4999559, 5999471, 7199369, 8639249, 10367101,


What algorithm/formula did you use to expand the table?

I plotted the existing points in Excel. I noticed that, except for the first few points, the rest rose by a factor of 1.2 times. So I wrote a little program to compute them so they would match the same pattern.

take the last prime, multiply it by 1.2

find the next prime >= the number in computed in step 1.

goto 1

This is going to change allocations in certain cases. For example, before your change, adding 8M items to a HashSet<int> would result in the _slots and _buckets arrays ending at 11998949 in length, and after your change they'll be 12440537 in length (~400K elements each, ~4% increase). Whether that's good or bad for the scenarios that hit this, I don't know. I do like that we can simplify the code by expanding out the table, but we should understand the full ramifications of doing so before such a change is made.

weshaggard · 2016-02-19T03:49:02Z

cc @ellismg @vancem

JonHanna · 2016-02-19T15:11:21Z

I tried these values in August, and found that the collections tests were appreciably slower to run, and consistently so, which was enough to make me suspect this wasn't a win.
What have you found in that regard?

vancem · 2016-02-19T16:02:27Z

In general I think this is a good change (because it makes the code simpler). I don't think the issue that we might use slightly different table sizes as Stephen notes is really a problem. The effects of any particular size number is small and will average out, and we should not get too hung up about that.

@JonHanna's data is concerning (but also very suprising, I am very suspicious of some outside effect purturbig the results). Can you describe the data you collected in more detail? First I would expect NO change for any perf tests that used dictionary sizes less then 7MB, is that what you saw? Frankly I would also expect no interesting change event above this since the new algorithm is roughly like the old one and the performance of a dictionary is only very weakly affected by how the table is grown.

So we should investigate if we have true negative data, but in the absence of that, this does not seem like a scary change at all.

SunnyWar · 2016-02-20T01:02:01Z

@JonHanna, I don't see how it's possible you got different results since these values in the table and the way they are used is EXACTLY the same for any hash tables of size 7199369 and below.

JonHanna · 2016-02-20T01:23:32Z

It's perfectly possible that I was just unlucky.

SunnyWar · 2016-02-20T01:45:04Z

@JonHanna Micro-bench tests are indeed tricky.

JonHanna · 2016-02-20T02:18:44Z

Yep, and that wasn't even a micro-bench really. I'm happy if someone says "well I compared the two here, and I don't know what Jon's talking about, they turn out the same" 😄

stephentoub · 2016-02-20T05:38:28Z

Test Innerloop CentOS7.1 Release Build and Test please
Test Innerloop CentOS7.1 Debug Build and Test please

#2727 Removed unnecessary IsPrime function after expanding table.

This reverts commit ddf8ca0, reversing changes made to 0a0ea7f.

* Reuse HashHelpers for BinaryFormatter objectholder hashes * Revert "Merge pull request #6203 from SunnyWar/master" This reverts commit ddf8ca0, reversing changes made to 0a0ea7f. * Change resource string, make HashTable reuse existing HashHelper * Add comment describing hash number growth * Add hash number growth tests for BinaryFormatter & HashSet * Disable tests on x86 because of OOMs

dotnet/corefx#2727 Removed unnecessary IsPrime function after expanding table. Commit migrated from dotnet/corefx@ddf8ca0

…efx#25509) * Reuse HashHelpers for BinaryFormatter objectholder hashes * Revert "Merge pull request dotnet/corefx#6203 from SunnyWar/master" This reverts commit dotnet/corefx@ddf8ca0, reversing changes made to dotnet/corefx@0a0ea7f. * Change resource string, make HashTable reuse existing HashHelper * Add comment describing hash number growth * Add hash number growth tests for BinaryFormatter & HashSet * Disable tests on x86 because of OOMs Commit migrated from dotnet/corefx@b6b5982

#2727 Removed unnecessary IsPrime function after expanding table.

3e17bf8

dnfclas added the cla-already-signed label Feb 18, 2016

stephentoub reviewed Feb 19, 2016
View reviewed changes

stephentoub added a commit that referenced this pull request Feb 22, 2016

Merge pull request #6203 from SunnyWar/master

ddf8ca0

#2727 Removed unnecessary IsPrime function after expanding table.

stephentoub merged commit ddf8ca0 into dotnet:master Feb 22, 2016

stephentoub added the netfx-port-consider label Apr 13, 2016

karelz modified the milestone: 1.0.0-rtm Dec 3, 2016

jkotas mentioned this pull request Nov 26, 2017

Add more pre-computed prime numbers to avoid expensive computation dotnet/coreclr#15225

Closed

ViktorHofer added a commit to ViktorHofer/corefx that referenced this pull request Nov 27, 2017

Revert "Merge pull request dotnet#6203 from SunnyWar/master"

4cd6d21

This reverts commit ddf8ca0, reversing changes made to 0a0ea7f.

ViktorHofer added a commit to ViktorHofer/corefx that referenced this pull request Mar 21, 2018

Revert "Merge pull request dotnet#6203 from SunnyWar/master"

a3d2616

This reverts commit ddf8ca0, reversing changes made to 0a0ea7f.

ViktorHofer mentioned this pull request Jan 31, 2020

Right sizing HashSet ctor is allocating too many elements dotnet/runtime#24261

Closed

picenka21 pushed a commit to picenka21/runtime that referenced this pull request Feb 18, 2022

Merge pull request dotnet/corefx#6203 from SunnyWar/master

a87c0b0

dotnet/corefx#2727 Removed unnecessary IsPrime function after expanding table. Commit migrated from dotnet/corefx@ddf8ca0

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

#2727 Removed unnecessary IsPrime function after expanding table.#6203

#2727 Removed unnecessary IsPrime function after expanding table.#6203
stephentoub merged 1 commit into
dotnet:masterfrom
SunnyWar:master

SunnyWar commented Feb 18, 2016

Uh oh!

stephentoub Feb 19, 2016

Uh oh!

SunnyWar Feb 19, 2016

Uh oh!

stephentoub Feb 19, 2016

Uh oh!

weshaggard commented Feb 19, 2016

Uh oh!

JonHanna commented Feb 19, 2016

Uh oh!

vancem commented Feb 19, 2016

Uh oh!

SunnyWar commented Feb 20, 2016

Uh oh!

JonHanna commented Feb 20, 2016

Uh oh!

SunnyWar commented Feb 20, 2016

Uh oh!

JonHanna commented Feb 20, 2016

Uh oh!

stephentoub commented Feb 20, 2016

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

Conversation

SunnyWar commented Feb 18, 2016

Uh oh!

stephentoub Feb 19, 2016

Choose a reason for hiding this comment

Uh oh!

SunnyWar Feb 19, 2016

Choose a reason for hiding this comment

Uh oh!

stephentoub Feb 19, 2016

Choose a reason for hiding this comment

Uh oh!

weshaggard commented Feb 19, 2016

Uh oh!

JonHanna commented Feb 19, 2016

Uh oh!

vancem commented Feb 19, 2016

Uh oh!

SunnyWar commented Feb 20, 2016

Uh oh!

JonHanna commented Feb 20, 2016

Uh oh!

SunnyWar commented Feb 20, 2016

Uh oh!

JonHanna commented Feb 20, 2016

Uh oh!

stephentoub commented Feb 20, 2016

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants