Fix gprod for integer64 by ben-schwen · Pull Request #5231 · Rdatatable/data.table

ben-schwen · 2021-10-21T17:21:49Z

Closes #5225.

codecov · 2021-10-21T17:28:31Z

Codecov Report

Merging #5231 (b4f8348) into master (e88826e) will increase coverage by 0.00%.
The diff coverage is 100.00%.

@@           Coverage Diff           @@
##           master    #5231   +/-   ##
=======================================
  Coverage   99.50%   99.50%           
=======================================
  Files          77       77           
  Lines       14590    14603   +13     
=======================================
+ Hits        14518    14531   +13     
  Misses         72       72

Impacted Files	Coverage Δ
src/gsumm.c	`100.00% <100.00%> (ø)`

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update e88826e...b4f8348. Read the comment docs.

mattdowle · 2021-10-25T19:14:11Z

Great spot and fix. It's certainly debatable, but keeping the spirit of base::prod always returning double is how gforce prod is done for regular integer and I'm thinking that makes sense for integer64 too. Despite integer64 having its own prod method. That could change in bit64 in future possibly. Can't really explain it very well, but what I've changed it to feels right to me. It's testing against fixed y values that are a bit more independent of bit64, and there's a reason to do this in that we like and agree with (in this particular case) base::prod always returning double.
Ok with you?

MichaelChirico · 2021-10-25T19:26:04Z

not advocating one way or the other, but just noting that one difference of int vs int64 is that there are values of int64 that can't be represented exactly in double right? IIRC somewhere like 2^53 where doubles no longer have integer-level precision. whereas that's not true for int -- all valid int can be represented as double.

so it seems like returning int64 could me more correct in some cases, IIUC

ben-schwen · 2021-10-25T21:07:53Z

Perfectly fine for me. The only problem I see arising then is that different optimization levels will produce different outputs, see also my comment here.

library(bit64)
DT = data.table(x=c(lim.integer64(), 1, 1), g=1:2)
options(datatable.optimize=0L)
DT[, prod(x), g]
#>    g                   V1
#> 1: 1 -9223372036854775807
#> 2: 2  9223372036854775807
options(datatable.optimize=2L)
DT[, prod(x), g]
#>    g            V1
#> 1: 1 -9.223372e+18
#> 2: 2  9.223372e+18

My worries are not about losing precision but about the different types of the different optimization levels. This can become an even bigger problem for the user when certain functions turn off gforce optimization as e.g. DT[, c(prod(x)), g]

mattdowle · 2021-11-16T11:04:16Z

Great points. Ok returning integer64 it is then.

ben-schwen added 7 commits October 19, 2021 12:47

init fix gprod

0e08e61

finish bit64

cfbd9e3

Merge branch 'master' into gforce_gprod

0bdacdb

move free outside of cases

7562da1

add test

ae0975a

delete newline

2ad2554

add NEWS

15a6b5c

change fix to keep double being returned

609af68

mattdowle added this to the 1.14.3 milestone Oct 25, 2021

back to returning integer64

b4f8348

mattdowle merged commit df836b2 into master Nov 16, 2021

mattdowle deleted the gforce_gprod branch November 16, 2021 11:14

jangorecki modified the milestones: 1.14.9, 1.15.0 Oct 29, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix gprod for integer64#5231

Fix gprod for integer64#5231
mattdowle merged 9 commits intomasterfrom
gforce_gprod

ben-schwen commented Oct 21, 2021

Uh oh!

codecov bot commented Oct 21, 2021 •

edited

Loading

Uh oh!

mattdowle commented Oct 25, 2021

Uh oh!

MichaelChirico commented Oct 25, 2021

Uh oh!

ben-schwen commented Oct 25, 2021 •

edited

Loading

Uh oh!

mattdowle commented Nov 16, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

ben-schwen commented Oct 21, 2021

Uh oh!

codecov bot commented Oct 21, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

mattdowle commented Oct 25, 2021

Uh oh!

MichaelChirico commented Oct 25, 2021

Uh oh!

ben-schwen commented Oct 25, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mattdowle commented Nov 16, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

codecov bot commented Oct 21, 2021 •

edited

Loading

ben-schwen commented Oct 25, 2021 •

edited

Loading