fix Issue 13474 - Discard excess precision for float and double (x87) by WalterBright · Pull Request #6247 · dlang/dmd

WalterBright · 2016-11-07T11:00:30Z

The following code:

 double foo(double x, double t, double s, double c) {
    double y = x - t;
    c += y + s;
    return s + c;
 }

The body of which, when optimized, looks like:

    return s + (c + (x - t) + s);

Or, in x87 instructions:

       fld     qword ptr 01Ch[ESP]
       fld     qword ptr 0Ch[ESP]
       fxch    ST(1)
       fsub    qword ptr 014h[ESP]
       fadd    qword ptr 0Ch[ESP]
       fadd    qword ptr 4[ESP]
       fstp    qword ptr 4[ESP]
       fadd    qword ptr 4[ESP]
       ret     020h

The algorithm relies on rounding to double precision of the (x-t) calculation. The only way to get the x87 to do that is to actually assign it to memory. But the compiler optimizes away the assignment to memory, because it is substantially slower.

The 64 bit code does not have this problem, because the code gen looks like:

       push    RBP
       mov     RBP,RSP
       movsd   XMM4,XMM0
       movsd   XMM5,XMM1
       subsd   XMM3,XMM2
       addsd   XMM3,XMM5
       addsd   XMM4,XMM3
       movsd   XMM0,XMM5
       addsd   XMM0,XMM4
       pop     RBP
       ret

It's doing the same optimization, but the result is rounded to double because the XMM registers are doubles.

Note that the following targets generate x87 code, not XMM code:

Win32, Linux32, FreeBSD32

because it is not guaranteed that the target has XMM registers. I suspect we don't really care about the floating point performance on those targets, but we do care that the code gives expected results.

This fix is to disable optimizing away the assignment to y for x87 code gen targets. The resulting code is:

       push    EAX
       push    EAX
       fld     qword ptr 024h[ESP]
       fsub    qword ptr 01Ch[ESP]
       fstp    qword ptr [ESP]         <== added store
       fld     qword ptr [ESP]          <== and reload
       fld     qword ptr 014h[ESP]
       fxch    ST(1)
       fadd    qword ptr 014h[ESP]
       fadd    qword ptr 0Ch[ESP]
       fstp    qword ptr 0Ch[ESP]
       fadd    qword ptr 0Ch[ESP]
       add     ESP,8
       ret     020h

dlang-bot · 2016-11-07T11:00:33Z

Fix	Bugzilla	Description
✓	13474	Discard excess precision for float and double (x87)

don-clugston-sociomantic · 2016-11-07T11:41:35Z

I'm very happy to see this. This addresses the biggest problem I've had with developing floating point code in D. Because this extra precision exists on some platforms but not others, I did not find any way of writing code which works for all three of x87 runtime, non-x87 runtime, and CTFE.

fix Issue 13474 - Discard excess precision for float and double (x87)

6db2246

WalterBright force-pushed the fix13474 branch from 35e4116 to 6db2246 Compare November 7, 2016 11:15

9il mentioned this pull request Nov 9, 2016

Add new function std.algorithm.iteration : cumulativeSum dlang/phobos#4881

Merged

andralex merged commit b9d6be2 into dlang:master Nov 10, 2016

WalterBright deleted the fix13474 branch November 10, 2016 11:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Comments

fix Issue 13474 - Discard excess precision for float and double (x87)#6247

fix Issue 13474 - Discard excess precision for float and double (x87)#6247
andralex merged 1 commit intodlang:masterfrom
WalterBright:fix13474

WalterBright commented Nov 7, 2016

Uh oh!

dlang-bot commented Nov 7, 2016

Uh oh!

don-clugston-sociomantic commented Nov 7, 2016

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

Comments

Conversation

WalterBright commented Nov 7, 2016

Uh oh!

dlang-bot commented Nov 7, 2016

Uh oh!

don-clugston-sociomantic commented Nov 7, 2016

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants