fix floating point parsing precision in some rare cases by MichaelChirico · Pull Request #4463 · Rdatatable/data.table

MichaelChirico · 2020-05-20T11:21:52Z

Solution actually easier than expected, h/t https://stackoverflow.com/a/4937591/3576984 for the inspiration.

codecov · 2020-05-20T11:34:26Z

Codecov Report

Merging #4463 (deb4db4) into master (9e6e453) will decrease coverage by 0.00%.
The diff coverage is 100.00%.

❗ Current head deb4db4 differs from pull request most recent head d120619. Consider uploading reports for the commit d120619 to get more accurate results

@@            Coverage Diff             @@
##           master    #4463      +/-   ##
==========================================
- Coverage   99.38%   99.38%   -0.01%     
==========================================
  Files          76       76              
  Lines       14490    14489       -1     
==========================================
- Hits        14401    14400       -1     
  Misses         89       89

Impacted Files	Coverage Δ
src/fread.c	`99.40% <100.00%> (-0.01%)`	⬇️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 9e6e453...d120619. Read the comment docs.

QuLogic · 2020-07-24T20:38:25Z

You could drop the lower half of the table (thus removing e+=300) and then do:

  // pow10lookup is always positive since sub-0 powers of 10 cannot
  //   be represented exactly, which can lead to (rare) precision errors, #4461
  r = e < 0 ? r/pow10lookup[300 - e] : r*pow10lookup[e];

(may be off-by-one)

… it was distracting to see the number as a column name too

mattdowle · 2021-08-27T14:24:25Z

I checked that the new test failed for me also before this PR.

mattdowle · 2021-08-27T17:01:05Z

Interesting that the other 12 out of 100,000 also straddle by 0.0...6 too. Building on your example:

> for (i in 0:99999) {
  s = sprintf("0.80606%05d", i)
  r = eval(parse(text=s))
  f = fread(text=s)$V1
  if (!identical(r, f))
    cat(s, sprintf("%1.17f", r), sprintf("%1.17f", f), "\n")
}
0.8060603509 0.80606035089999994 0.80606035090000006 
0.8060614740 0.80606147399999994 0.80606147400000006 
0.8060623757 0.80606237569999994 0.80606237570000006 
0.8060629084 0.80606290839999994 0.80606290840000006 
0.8060632774 0.80606327739999994 0.80606327740000006 
0.8060638101 0.80606381009999994 0.80606381010000006 
0.8060647118 0.80606471179999994 0.80606471180000006 
0.8060658349 0.80606583489999994 0.80606583490000006 
0.8060667366 0.80606673659999994 0.80606673660000006   # Gabe's pick
0.8060672693 0.80606726929999994 0.80606726930000006 
0.8060676383 0.80606763829999994 0.80606763830000006 
0.8060681710 0.80606817099999994 0.80606817100000006 
0.8060690727 0.80606907269999994 0.80606907270000006

mattdowle · 2021-08-27T17:58:34Z

@MichaelChirico better?

MichaelChirico · 2021-08-27T19:19:58Z

perfect! much improved.

fix floating point parsing precision in some rare cases

fce8610

mattdowle added this to the 1.14.1 milestone Aug 27, 2021

mattdowle added 3 commits August 26, 2021 19:15

merge master

2b60945

clearer news item for folk scanning news quickly, and simpler test as…

d19629c

… it was distracting to see the number as a column name too

drop lower half of lookup table as @QuLogic suggested

d120619

mattdowle merged commit f1a4072 into master Aug 27, 2021

mattdowle deleted the fread-precision branch August 27, 2021 14:33

mattdowle added a commit that referenced this pull request Aug 27, 2021

news-only: more detail for #4463

6b0c45b

mattdowle added a commit that referenced this pull request Aug 27, 2021

news-only: tweak news item detail for #4463

490d460

tlapak mentioned this pull request Mar 12, 2022

Obtaining different object from same file #5346

Open

jangorecki modified the milestones: 1.14.9, 1.15.0 Oct 29, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix floating point parsing precision in some rare cases#4463

fix floating point parsing precision in some rare cases#4463
mattdowle merged 4 commits intomasterfrom
fread-precision

MichaelChirico commented May 20, 2020

Uh oh!

codecov bot commented May 20, 2020 •

edited

Loading

Uh oh!

QuLogic commented Jul 24, 2020 •

edited

Loading

Uh oh!

mattdowle commented Aug 27, 2021

Uh oh!

mattdowle commented Aug 27, 2021 •

edited

Loading

Uh oh!

mattdowle commented Aug 27, 2021

Uh oh!

MichaelChirico commented Aug 27, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

MichaelChirico commented May 20, 2020

Uh oh!

codecov bot commented May 20, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

QuLogic commented Jul 24, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mattdowle commented Aug 27, 2021

Uh oh!

mattdowle commented Aug 27, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mattdowle commented Aug 27, 2021

Uh oh!

MichaelChirico commented Aug 27, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

codecov bot commented May 20, 2020 •

edited

Loading

QuLogic commented Jul 24, 2020 •

edited

Loading

mattdowle commented Aug 27, 2021 •

edited

Loading