std.format: format floats and doubles with %g / %G / %s by berni44 · Pull Request #7804 · dlang/phobos

berni44 · 2021-02-18T17:06:46Z

There where mainly three things I had to do to implement %g:

Find the exact break points when to use %e and when to us %f
Remove trailing 0s and dot, when # is not present
Take care of the different meaning of precision (significant digits instead of fractional digits)
(oops, one more): Take care of 0.0.

As usual I ran lots of external tests comparing the current output with the new one: All floats; all combinations of flags/rounding mode/selection of number,width and precision; several billion random samples. Aside from known bugs in the old implementation no differences where found. (I also ran the tests for %e and %f to make sure I didn't mess up anything there.)

…with %g, %G and %s.

dlang-bot · 2021-02-18T17:06:49Z

Thanks for your pull request and interest in making D better, @berni44! We are looking forward to reviewing it, and you should be hearing from a maintainer soon.
Please verify that your PR follows this checklist:

My PR is fully covered with tests (you can see the coverage diff by visiting the details link of the codecov check)
My PR is as minimal as possible (smaller, focused PRs are easier to review than big ones)
I have provided a detailed rationale explaining my changes
New or modified functions have Ddoc comments (with Params: and Returns:)

Please see CONTRIBUTING.md for more information.

If you have addressed all reviews or aren't sure how to proceed, don't hesitate to ping us with a simple comment.

Bugzilla references

Auto-close	Bugzilla	Severity	Description
✗	20451	normal	comparing identical floating points does not work on Win32 and FreeBSD32.
✓	21641	minor	std.format: %g produces in rare circumstances inconsistent result

Testing this PR locally

If you don't have a local development environment setup, you can use Digger to test this PR:

dub run digger -- build "master + phobos#7804"

berni44 · 2021-02-18T19:21:59Z

I expected somewhat, that these unittests would fail on some processors, probably those with real==double. The floating point calculations in the switch in printFloatG have got rounding errors there, while not on other computers due to the internal use of (large enough) reals.

Finding a way around this, is difficult. The selection between %e and %f in snprintf is based on the decimal representation, not on the binary. That means, if we want to get around the floating point calculations, we would need some (expensive) BigInt calculations or a (large) lookup table. I dislike both, because it only applies in really, really rare occasions and the effort seems not to pay off here.

What happens in these situations is, that it prints the correct number, just the decision between scientific and "normal" representation is wrong (i. E. 1e-04 instead of 0.0001 or 0.0000999999 instead of 9.99999e-05).

One solution, although I do not know if you could accept it, would be to replace the unittest with a little bit more relaxed one, like this:

    double val = nextUp(0.00009999995);
    assert(printFloat(buf[], val, f) == "0.0001");
    val = nextDown(val);
    auto result = printFloat(buf[], val, f);
    assert(result == "0.0001" || result == "1e-04");
    val = nextDown(val);
    result = printFloat(buf[], val, f);
    assert(result == "9.99999e-05" || result == "0.0000999999");
    val = nextDown(val);
    assert(printFloat(buf[], val, f) == "9.99999e-05");

What do you think? Any ideas, how to get around this problem?

PS: Maybe it's possible to check the result with a regex and call the correct function in case the result is wrong... I'll have to sleep on it.

…t result

berni44 · 2021-02-20T20:33:08Z

To get around that problem, I replaced the calls to ^^ by a specialized function using ulongs. It fixes some really rare corner cases I overlooked before.

Unfortunately freebsd32 and win32 still fail on one unittest. After narrowing down the bug I ended up with a wrong floating point calculation:

unittest
{
        double a = 0.00009999995;
        double b = 0.00000000005;

        writefln!"%.80f"(a);
        writefln!"%.80f"(0.0001 - b);

        assert(!(a < 0.0001 - b));
}

The two writeflns show, that both values have identical bit patterns, but anyway the assert fires on the two 32-bit-computers.

Because fixing this bug is beyond this PR (*) I commented out the offending unittest.

(*) It would mean to add support for soft floats, or at least the parts, that are needed for the calculations in printFloatG.

PS: I think this is good to go now.

berni44 · 2021-02-24T07:06:27Z

@RazvanN7 Could you remove the WIP label please? I can't do so...

RazvanN7

I haven't thoroughly reviewed this, but there are some things that I noticed at a cursory glance:

please do not mix refactorings (even if they are small) with code additions. There are some places where a-b is transformed to a - b. Try to keep the diff as small as possible.
is there any way that this PR can be broken in several smaller PRs? It is very hard to review it as is. For example, is it possible to incrementally add the modifications to the printFloatE and printFloatF functions and then make a PR that adds printFloatG? That would make the reviewers job a lot easier.
Is there an alternative to using if(is_g) everywhere? This is effectively adding a lot of branches and might negatively impact performance. Normally, the format string should be known at compile time and therefore static ifs could be used, but I don't know exactly if this applies in this situation.

berni44 · 2021-02-24T13:07:39Z

please do not mix refactorings (even if they are small) with code additions. There are some places where a-b is transformed to a - b. Try to keep the diff as small as possible.

You're right. I probably did this, because I was asked to do so in the %f PR. Anyway, I'll open a separate PR for that.

is there any way that this PR can be broken in several smaller PRs?

Yes, I think so. But I'll start with adding printFloatG, because this is the part that is most problematic with regards to the content and I want to get this discussed first.

Is there an alternative to using if(is_g) everywhere? This is effectively adding a lot of branches and might negatively impact performance.

I avoided them inside of loops which means, the impact should be little.

Normally, the format string should be known at compile time and therefore static ifs could be used, but I don't know exactly if this applies in this situation.

Well that's not always true. But we do not need the format string here, to make the decision static, we only need to know if we are processing %e/%f or %g, which we do know.

I'm closing this, to keep it as a reference, and start new PRs.

RazvanN7 · 2021-02-24T13:10:50Z

Thanks @berni44 ! I look forward to the split PRs.

Partial replace call to snprintf for formating floatingpoint numbers …

15cb775

…with %g, %G and %s.

dlang-bot added the Severity:Bug Fix label Feb 18, 2021

berni44 changed the title ~~std.format: format floats and doubles with %g / %G / %s~~ [WIP] std.format: format floats and doubles with %g / %G / %s Feb 19, 2021

dlang-bot added the Review:WIP Work In Progress - not ready for review or pulling label Feb 19, 2021

berni44 force-pushed the printFloatG branch from 8698109 to 7e081e4 Compare February 20, 2021 10:05

berni44 added 3 commits February 20, 2021 15:03

Replace floating point math with own function based on ulongs

359e1db

Add unittests for CTFE and correct error message.

9931065

Fix 21641 - std.format: %g produces in rare circumstances inconsisten…

16b5da0

…t result

berni44 force-pushed the printFloatG branch 3 times, most recently from 6fe5710 to a9d5c90 Compare February 20, 2021 18:48

Comment out one unittest due to issue 20451.

da365b7

berni44 force-pushed the printFloatG branch from ab8a5c0 to da365b7 Compare February 20, 2021 19:49

berni44 changed the title ~~[WIP] std.format: format floats and doubles with %g / %G / %s~~ std.format: format floats and doubles with %g / %G / %s Feb 20, 2021

RazvanN7 removed the Review:WIP Work In Progress - not ready for review or pulling label Feb 24, 2021

RazvanN7 reviewed Feb 24, 2021

View reviewed changes

berni44 closed this Feb 24, 2021

This was referenced Mar 1, 2021

std.format: Add %g (step 3): Add template parameter to printFloatE/F. #7816

Merged

std.format: Add %g (step 4): Formatting 0.0 #7818

Merged

berni44 deleted the printFloatG branch April 28, 2021 17:39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

std.format: format floats and doubles with %g / %G / %s#7804

std.format: format floats and doubles with %g / %G / %s#7804
berni44 wants to merge 5 commits intodlang:masterfrom
berni44:printFloatG

berni44 commented Feb 18, 2021

Uh oh!

dlang-bot commented Feb 18, 2021 •

edited

Loading

Uh oh!

berni44 commented Feb 18, 2021 •

edited

Loading

Uh oh!

berni44 commented Feb 20, 2021 •

edited

Loading

Uh oh!

berni44 commented Feb 24, 2021

Uh oh!

RazvanN7 left a comment •

edited

Loading

Uh oh!

berni44 commented Feb 24, 2021

Uh oh!

RazvanN7 commented Feb 24, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

berni44 commented Feb 18, 2021

Uh oh!

dlang-bot commented Feb 18, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Bugzilla references

Testing this PR locally

Uh oh!

berni44 commented Feb 18, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

berni44 commented Feb 20, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

berni44 commented Feb 24, 2021

Uh oh!

RazvanN7 left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

berni44 commented Feb 24, 2021

Uh oh!

RazvanN7 commented Feb 24, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

dlang-bot commented Feb 18, 2021 •

edited

Loading

berni44 commented Feb 18, 2021 •

edited

Loading

berni44 commented Feb 20, 2021 •

edited

Loading

RazvanN7 left a comment •

edited

Loading