Address bugs in BigInteger by ts2do · Pull Request #27280 · dotnet/coreclr

ts2do · 2019-10-18T02:01:56Z

Method Add(ref BigInteger lhs, uint value, ref BigInteger result) would store most of the result blocks into lhs instead of result
Method ShiftLeft(ulong input, uint shift, ref BigInteger output) with a shift argument not evenly divisible by 32 would generally compute the higher blocks incorrectly

Method Add(ref BigInteger lhs, uint value, ref BigInteger result) would store most of the result blocks into lhs instead of result Method ShiftLeft(ulong input, uint shift, ref BigInteger output) with a shift argument exceeding 32 would generally compute the higher blocks incorrectly

AntonLapounov

Too many bugs in this code. For Add we should also set result._length in case carry == 0. For ShiftLeft the first if misses setting output. It should read like this:

if ((input == 0) || (shift == 0))
{
    output.SetUInt64(input);
    return;
}

src/System.Private.CoreLib/shared/System/Number.BigInteger.cs

tannergooding · 2019-10-18T15:34:57Z

Looking at the things being fixed in this PR and the item called out by Anton. These bugs don't cause issues in production because the code code paths/failure cases in question aren't currently hit (and I don't believe ever will be hit).

This type is only meant for (and is designed around) float/double formatting/parsing; so I think the appropriate way to fix these bugs would be to remove the unnecessary logic. That should both "fix" the bugs and would potentially make the code faster for its intended purpose.

For example, public static void Add(ref BigInteger lhs, uint value, ref BigInteger result) is only ever called as public void Add(uint value) => Add(ref this, value, ref this); and public static void ShiftLeft(ulong input, uint shift, ref BigInteger output) is only ever used as ShiftLeft(1, exponent, ref result); (where result = new BigInteger(0)).

tannergooding · 2019-10-18T15:41:31Z

Also, for reference, it looks like some of these bugs existed in the original BigNum.cpp code that this code was ported from: #19999

AntonLapounov · 2019-10-18T16:11:46Z

@tannergooding I would not trust other parts of this code either. For instance, SetUInt32 and SetUInt64 allow creating non-canonical zero values with _length greater than zero. That means IsZero may not work correctly.

tannergooding · 2019-10-18T16:35:46Z

For instance, SetUInt32 and SetUInt64 allow creating non-canonical zero values with _length greater than zero.

Yes, but it should never actually happen given the existing usages and if it did; would just result in a slower computation; not necessarily an incorrect computation. That is, the code as is being used; and as was ported from the cpp code; is still functioning correctly.

There are certainly cases where Debug.Assert could be added and where the code is needlessly complicated (as a given scenario will never be hit). So, I'm just saying it would likely be better to simplify the code as part of fixing those; rather than trying to fix the logic to cover a case that will never be hit.

AntonLapounov · 2019-10-18T17:12:25Z

not necessarily an incorrect computation

Well, Compare is also affected and we use it for control flow. For example:

var x = new BigInteger();
x.SetUInt64(0);
var y = new BigInteger(0);

// Outputs 1
Console.WriteLine(BigInteger.Compare(ref x, ref y));

I agree that if there were an easy way to hit one of these bugs, we would have hit it a long time ago. Still that makes it very challenging to reason about this code.

AntonLapounov · 2019-10-18T23:03:20Z

@ts2do Thank you for spotting these bugs. As @tannergooding mentioned, the best approach is to remove both problematic methods by inlining them into their callers and simplifying. Please let us know if you might help with that or prefer us to change the code.

ts2do · 2019-10-19T01:09:30Z

The static Add method overload is indeed being used directly by in src/System.Private.CoreLib/shared/System/Number.Dragon4.cs (which is invoked by FormatSingle and FormatDouble in some cases).
As for ShiftLeft, wouldn't it be better to fix the bug to simply support arbitrary shifting instead of limiting the shift parameter to 32? Without looking too far into it, ShiftLeft is used in Dragon4 with no apparent guarantee that the value will not exceed 32.

ts2do · 2019-10-19T01:55:21Z

I've made the requested changes.

I also have a more heavily modified version of BigInteger that I believe makes it 1) more consistent (e.g., results of static methods are always provided via out parameters), 2) safer (e.g., added Debug.Assert calls which ensure that the result argument for Multiply and DivRem does not share the same address as one of the operands), and 3) more correct (e.g., skip Buffer.Memcpy if rhs is the same address as this). I figured it would be prudent to offer critical bug fixes instead of offering it all at once and having them lost in the noise.

Here's a gist with all of the changes:
https://gist.github.com/ts2do/629f3bc1a92eb10be18c1898b6169094

A few more notes:
I removed the BigInteger(uint) and BigInteger(ulong) constructors, as benchmarks results compared against directly setting fields have me thinking the fixed array is being zeroed. They would be replaced by static void SetUInt32(out BigInteger result, uint value) and static void SetUInt32(out BigInteger result, ulong value) to avoid that behavior.
I added an AsString method to BigInteger to help me debug it a little.

AntonLapounov · 2019-10-19T03:08:24Z

@ts2do Note that Dragon4 uses different ShiftLeft and Add overloads. The two problematic overloads have only a single caller each, which are in the same Number.BigInteger.cs file. In particular, ShiftLeft is used for Pow2 only. We want to remove these problematic methods, not to fix them. There are more issues, even with your latest commit.

Inline Add(ref BigInteger, uint, ref BigInteger) into Add(uint) Inline ShiftLeft(ulong, uint, ref BigInteger) into Pow2 Inline ExtendBlock and ExtendBlocks into Pow2 Handle 0 in SetUInt32 and SetUInt64

ts2do · 2019-10-23T03:04:09Z

My mistake, I was searching some local changes that I was toying around with. Sorry for the delay, I've been having some trouble running tests against the changes (though I got it building). I plan to redo my setup with a fresh installs soon, so until then, I guess I'll use CI to test it?

AntonLapounov · 2019-10-23T19:37:02Z

@ts2do Thank you. Your changes look great in general. Reviewing now.

src/System.Private.CoreLib/shared/System/Number.BigInteger.cs

AntonLapounov · 2019-10-24T19:28:45Z

@tannergooding Don't we have a real product bug in Multiply(ref, uint, ref)? We clearly call that method with lhs ≠result in four places in Dragon4. For instance:

var x = new BigInteger(7);
var y = new BigInteger(0);
BigInteger.Multiply(ref x, 6, ref y);
// Expected value: 42, actual value: 0
Console.WriteLine(y.ToUInt64());

src/System.Private.CoreLib/shared/System/Number.BigInteger.cs

tannergooding · 2019-10-24T19:43:21Z

Don't we have a real product bug in Multiply(ref, uint, ref)

There is a bug, but I don't believe it is one that can repro in production. The usages in Number.Dragon4 are around scaledMarginLow and pScaledMarginHigh and we only call Multiply when pScaledMarginHigh != &scaledMarginLow. Due to how the code works, the length of pScaledMarginHigh will always be greater than or equal to scaledMarginLow and will either stay the same (no carry) or will grow by one (in which case we do update it -- this is due to us only multiply by 2, which is a left shift by 1).

AntonLapounov · 2019-10-24T20:12:02Z

the length of pScaledMarginHigh will always be greater than or equal to scaledMarginLow

@tannergooding What about this code path, where we multiply scaledMarginLow by a presumably big power of 10, which may make its length greater than the length of pScaledMarginHigh?

coreclr/src/System.Private.CoreLib/shared/System/Number.Dragon4.cs

Lines 222 to 226 in 5b1c001

    
           scaledMarginLow.Multiply(ref pow10); 
        
           if (pScaledMarginHigh != &scaledMarginLow) 
        
           { 
        
               BigInteger.Multiply(ref scaledMarginLow, 2, ref *pScaledMarginHigh);

tannergooding · 2019-10-24T20:45:06Z

What about this code path, where we multiply scaledMarginLow by a presumably big power of 10, which may make its length greater than the length of pScaledMarginHigh?

I'd need to check some math to determine if that is fine or not, but I believe it still works out due to how the margins exist and where they exist.

Still noting that this was ported from the native code and so the bug, if it exists, has been around basically forever.

AntonLapounov · 2019-10-28T23:10:24Z

@tannergooding I have tried

Dragon4Double(1.0 / (1L << 31), -1, true, ref buffer);

under a debugger and noticed that *pScaledMarginHigh was indeed calculated incorrectly as I anticipated. Namely, after multiplying by a power of 10, scaledMarginLow equals to 10⁹ and *pScaledMarginHigh must be 2×10⁹; however, its actual value was 2,820,130,816 due to not setting its _length field. Then Dragon4 used that incorrect value in a loop.

After fixing IsZero calculation, some tests are failing with:

Process terminated. Assertion failed.
   at System.Number.NumberToFloatingPointBitsSlow(NumberBuffer& number, FloatingPointInfo& info, UInt32 positiveExponent, UInt32 integerDigitsPresent, UInt32 fractionalDigitsPresent) in /_/src/System.Private.CoreLib/shared/System/Number.NumberToFloatingPointBits.cs:line 489
   at System.Number.NumberToFloatingPointBits(NumberBuffer& number, FloatingPointInfo& info) in /_/src/System.Private.CoreLib/shared/System/Number.NumberToFloatingPointBits.cs:line 414
   at System.Number.NumberToDouble(NumberBuffer& number) in /_/src/System.Private.CoreLib/shared/System/Number.Parsing.cs:line 1996

Would you be able to look at them?

AntonLapounov · 2019-10-29T00:20:41Z

It seems that before this fix Dragon4Double used to work incorrectly for 39% of the powers of two. Fortunately, we use this algorithm as a fallback only.

For instance, 2⁵⁵ is converted to 36028797018963968 before the fix and 36028797018963970 after the fix (one trailing digit shorter). 2⁹⁵⁷ is converted to 1.21816425142499988e+288 before the fix and 1.218164251425e+288 after the fix (five trailing digits shorter).

tannergooding · 2019-11-04T22:25:17Z

Looks like its failing because the fractionalNumerator is asserted to be non-zero: https://source.dot.net/#System.Private.CoreLib/shared/System/Number.NumberToFloatingPointBits.cs,487

This assertion "should" still hold true as we have at least 1 fractional digit present

tannergooding · 2019-11-05T19:37:00Z

Put up a PR for the fix here: #27688

maryamariyan · 2019-11-06T21:03:35Z

Thank you for your contribution. As announced in dotnet/coreclr#27549 this repository will be moving to dotnet/runtime on November 13. If you would like to continue working on this PR after this date, the easiest way to move the change to dotnet/runtime is:

In your coreclr repository clone, create patch by running git format-patch origin
In your runtime repository clone, apply the patch by running git apply --directory src/coreclr <path to the patch created in step 1>

tannergooding · 2019-11-07T21:07:40Z

@ts2do, if you rebase your changes ontop of (or merge with) the latest master, everything should pass now and we can get this merged 😄

ts2do · 2019-11-11T23:15:43Z

Everything should be all set.

AntonLapounov · 2019-11-11T23:37:11Z

@tsdo Nice work — thanks a lot!

* Method Add(ref BigInteger lhs, uint value, ref BigInteger result) would store most of the result blocks into lhs instead of result. * Method ShiftLeft(ulong input, uint shift, ref BigInteger output) with a shift argument exceeding 32 would generally compute the higher blocks incorrectly. * Multiply(ref BigInteger lhs, uint value, ref BigInteger result) would not set result._length in some cases. * IsZero() would incorrectly return false for non-canonical zeros with _length > 0. Fix: * Inline Add(ref BigInteger, uint, ref BigInteger) into Add(uint). * Inline ShiftLeft(ulong, uint, ref BigInteger) into Pow2. * Inline ExtendBlock and ExtendBlocks into Pow2. * Properly handle 0 in SetUInt32 and SetUInt64. Signed-off-by: dotnet-bot <dotnet-bot@microsoft.com>

jkotas requested a review from tannergooding October 18, 2019 02:30

AntonLapounov suggested changes Oct 18, 2019

View reviewed changes

tannergooding reviewed Oct 18, 2019

View reviewed changes

src/System.Private.CoreLib/shared/System/Number.BigInteger.cs Outdated Show resolved Hide resolved

BigInteger changes

148ffa2

ts2do added 2 commits October 22, 2019 21:48

BigInteger fixes

d22bc5f

Inline Add(ref BigInteger, uint, ref BigInteger) into Add(uint) Inline ShiftLeft(ulong, uint, ref BigInteger) into Pow2 Inline ExtendBlock and ExtendBlocks into Pow2 Handle 0 in SetUInt32 and SetUInt64

Merge branch 'master' into BigIntegerFix

7009b90

AntonLapounov reviewed Oct 23, 2019

View reviewed changes

src/System.Private.CoreLib/shared/System/Number.BigInteger.cs Outdated Show resolved Hide resolved

AntonLapounov reviewed Oct 23, 2019

View reviewed changes

src/System.Private.CoreLib/shared/System/Number.BigInteger.cs Outdated Show resolved Hide resolved

AntonLapounov reviewed Oct 23, 2019

View reviewed changes

src/System.Private.CoreLib/shared/System/Number.BigInteger.cs Outdated Show resolved Hide resolved

AntonLapounov reviewed Oct 24, 2019

View reviewed changes

src/System.Private.CoreLib/shared/System/Number.BigInteger.cs Show resolved Hide resolved

Requested changes to BigInteger

cb4b44b

AntonLapounov approved these changes Oct 28, 2019

View reviewed changes

jkotas added the area-System.Runtime label Nov 2, 2019

tannergooding mentioned this pull request Nov 5, 2019

Updating NumberToFloatingPointBitsSlow to handle the remaining parsed fractional digits being zero. #27688

Merged

Merge remote-tracking branch 'upstream/master' into BigIntegerFix

29b6af6

AntonLapounov merged commit 65a7947 into dotnet:master Nov 11, 2019

Conversation

ts2do commented Oct 18, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

AntonLapounov left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

tannergooding commented Oct 18, 2019

Uh oh!

tannergooding commented Oct 18, 2019

Uh oh!

AntonLapounov commented Oct 18, 2019

Uh oh!

tannergooding commented Oct 18, 2019

Uh oh!

AntonLapounov commented Oct 18, 2019

Uh oh!

AntonLapounov commented Oct 18, 2019

Uh oh!

ts2do commented Oct 19, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ts2do commented Oct 19, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

AntonLapounov commented Oct 19, 2019

Uh oh!

ts2do commented Oct 23, 2019

Uh oh!

AntonLapounov commented Oct 23, 2019

Uh oh!

Uh oh!

Uh oh!

Uh oh!

AntonLapounov commented Oct 24, 2019

Uh oh!

Uh oh!

tannergooding commented Oct 24, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

AntonLapounov commented Oct 24, 2019

Uh oh!

tannergooding commented Oct 24, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

AntonLapounov commented Oct 28, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

AntonLapounov commented Oct 29, 2019

Uh oh!

tannergooding commented Nov 4, 2019

Uh oh!

tannergooding commented Nov 5, 2019

Uh oh!

maryamariyan commented Nov 6, 2019

Uh oh!

tannergooding commented Nov 7, 2019

Uh oh!

ts2do commented Nov 11, 2019

Uh oh!

AntonLapounov commented Nov 11, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

ts2do commented Oct 18, 2019 •

edited

Loading

ts2do commented Oct 19, 2019 •

edited

Loading

ts2do commented Oct 19, 2019 •

edited

Loading

tannergooding commented Oct 24, 2019 •

edited

Loading

tannergooding commented Oct 24, 2019 •

edited

Loading

AntonLapounov commented Oct 28, 2019 •

edited

Loading