-
Notifications
You must be signed in to change notification settings - Fork 4k
ARROW-9054: [C++] Add ScalarAggregateOptions #9758
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
As per comments in Jiras this should introduce the following:
|
d210140 to
97c78d6
Compare
c71b1f8 to
6e89d74
Compare
|
Should MinMax be updated to use ScalarAggregateOptions? You should also be able to remove https://github.com/apache/arrow/blob/master/r/R/compute.R#L113-L117 and search for any other references to that jira number in the R code/tests. |
I'll look into it. Count would be a candidate as well.
Will do. |
|
This is slowly coming together. Remaining todo:
|
Is this desired behaviour? I think in general the resulting data type should be known based on the input types and operation (and not depend on the values) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Just to make sure I am reading this one correctly, is it basically testing mean([], min_count=0) == 0 ?
I am not sure that should give 0 as result? I would have expected either NaN or Null.
Basically mean of empty is dividing 0/0, which is NaN.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Agreed. I think I've put it there to have it documented while I was changing other things.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
It seems you changed it to return null now? Should it rather return NaN? (I assume it could be consistent with a manual mean from sum/count with min_count=0)
|
Looking forward to this! Added a few comments on naming and exact expected semantics. |
85e8bf0 to
1478720
Compare
|
This requires some more work on C Glib but is ready for review otherwise. |
|
@jorisvandenbossche & @nealrichardson could you take a look at this proposal? |
|
Should we introduce |
40f5e49 to
ce22f1a
Compare
Co-authored-by: Joris Van den Bossche <jorisvandenbossche@gmail.com>
* Fix typos * Improve styles * Add more tests
|
Some checks timed out. Rebased to restart the tasks. |
|
The "R / AMD64 Windows RTools 35" failure was occurred on master too: https://github.com/apache/arrow/runs/2634288057 It's unrelated to this. I'll merge this. |
|
Thanks for the help everyone! |
ARROW-9054.
This adds
ScalarAggregateOptionsto control null behavior ofmeanandsumkernels.