Use Double.NEGATIVE_INFINITY and Double.POSITIVE_INFINITY by leventov · Pull Request #4496 · apache/druid

leventov · 2017-07-01T00:01:01Z

Instead of Double.MIN_VALUE and Double.MAX_VALUE, same for Float.

Double.MIN_VALUE is the smallest positive non-zero double value, not "very negative" value.

The effect e. g. is that DoubleMaxAggregator with expression column could never return anything negative (seems so).

~~Also there is a bug~~ in RTree where maxCoords are filled with Float.MAX_VALUE, and opposite for minCoords.

…Double.MIN_VALUE and Double.MAX_VALUE, same for Float

jihoonson · 2017-07-04T23:39:34Z

      <property name="message" value="Use Comparators.naturalNullsFirst() instead of Ordering.natural().nullsFirst()"/>
    </module>
+
+    <module name="Regexp">


Do these patterns need to be prohibited? I'm not sure.

I think yes, because they are 1) never needed in the codebase so far 2) bug-prone, people use them instead of Infinity constants.

jihoonson · 2017-07-04T23:40:20Z


        mergedPositions[currentIndex] = mm0;
-        //mergedPositions[nextIndex] = Float.MAX_VALUE; // for debugging
+        //mergedPositions[nextIndex] = Float.POSITIVE_INFINITY; // for debugging


Looks that should be removed.

jihoonson · 2017-07-04T23:40:27Z

          heapSize = heapDelete(heap, reverseIndex, heapSize, reverseIndex[currentIndex], deltas);

-          //deltas[currentIndex] = Float.MAX_VALUE; // for debugging
+          //deltas[currentIndex] = Float.POSITIVE_INFINITY; // for debugging


Looks that should be removed.

jihoonson · 2017-07-04T23:40:32Z


        // mark the merged bin as invalid
-        // deltas[nextIndex] = Float.MAX_VALUE; // for debugging
+        // deltas[nextIndex] = Float.POSITIVE_INFINITY; // for debugging


Looks that should be removed.

gianm

Looks good, but would appreciate more unit tests verifying that the behavior is good now (such as for the doubleMin/doubleMax aggregators). Especially to verify that all the code paths involved do deal well with infinities.

gianm · 2017-07-06T15:08:49Z

@leventov do you have any thoughts on my comment about tests? This is one of the last patches blocking 0.10.1-rc2 so I'm hoping we can resolve it quickly.

drcrallen · 2017-07-06T15:27:00Z

Also the json serde needs to make sure it handles the values for positive and negative infinity as well

leventov · 2017-07-06T15:57:52Z

@gianm I have another patch I want to include into 0.10.1, not published yet

…lue-bugs

gianm · 2017-07-07T00:40:13Z

Thanks for adding some tests. I would prefer one for doubleMin/doubleMax too but will not consider that blocking to the PR. Please resolve conflicts too and this looks good to me.

leventov · 2017-07-07T00:41:09Z

@gianm doubleMin/doubleMax bug couldn't probably exploited now. It needs to have EvalExpr of null converted from float column, I didn't find a way to do that so that Calcite's type checker doesn't complain.

b-slim · 2017-07-07T15:01:24Z

+      <property name="message" value="Use Float.POSITIVE_INFINITY"/>
+    </module>
+    <module name="Regexp">
+      <property name="format" value="Float\.MIN_VALUE"/>


not sure how this check style will work. Does this means any usage of Float.MIN_VALUE is prohibited ?

yes; see #4496 (comment). I guess if anyone needs it for real then the checkstyle rule could be removed or altered.

* Use Double.NEGATIVE_INFINITY and Double.POSITIVE_INFINITY instead of Double.MIN_VALUE and Double.MAX_VALUE, same for Float * Replace usages in comments * Fix RTree * Remove commented code * Add tests

b-slim · 2017-07-07T15:24:53Z

    float[] initMaxCoords = new float[numDims];
-    Arrays.fill(initMinCoords, -Float.MAX_VALUE);
-    Arrays.fill(initMaxCoords, Float.MAX_VALUE);
+    Arrays.fill(initMinCoords, Float.NEGATIVE_INFINITY);


I am not familiar with the code of Rtree but am not sure if this change really make sense. After this change any operation on the root node coordinate will always return infinity, while before i can have for instance an incremental change over the root coordinate.
Eg, node.getMinCoordinates()[0] [* or /] 100 will return Infinity if we use Float.NEGATIVE_INFINITY as oppose to it will have a defined value if we keep Float.MAX_VALUE

I think it should be ok. I didn't check the full code around RTree in druid, but usually the min/max coordinates of an RTree node represents a minimum range covering all coordinates of the children nodes. It is used for early pruning when traversing the tree. So, I think the min/max coordinates of the root node shouldn't used for any computation.

b-slim · 2017-07-07T15:34:13Z

@gianm was still looking to review this PR...

b-slim · 2017-07-07T15:35:53Z

      if (ratio >= 1) {
        // handle very unlikely case that value is > 2^64
-        return Double.MAX_VALUE;
+        return Double.POSITIVE_INFINITY;


Same as comment above why this need to be +infinity and not just Max_value ?

gianm · 2017-07-07T15:38:37Z

@gianm was still looking to review this PR...

Ah, sorry, didn't realize that. I'd say keep reviewing and if you uncover anything that needs to change let's get it in to 0.10.1.

) * Use Double.NEGATIVE_INFINITY and Double.POSITIVE_INFINITY instead of Double.MIN_VALUE and Double.MAX_VALUE, same for Float * Replace usages in comments * Fix RTree * Remove commented code * Add tests

Use Double.NEGATIVE_INFINITY and Double.POSITIVE_INFINITY instead of …

3ea542f

…Double.MIN_VALUE and Double.MAX_VALUE, same for Float

leventov added Bug WIP labels Jul 1, 2017

leventov added this to the 0.10.1 milestone Jul 1, 2017

leventov added 2 commits June 30, 2017 20:05

Replace usages in comments

963b245

Fix RTree

53b4d27

leventov removed the WIP label Jul 1, 2017

jihoonson reviewed Jul 4, 2017

View reviewed changes

Remove commented code

9719b56

gianm reviewed Jul 5, 2017

View reviewed changes

leventov added 2 commits July 6, 2017 19:36

Add tests

db0cfc2

Merge remote-tracking branch 'upstream/master' into double-min-max-va…

fd8e343

…lue-bugs

b-slim reviewed Jul 7, 2017

View reviewed changes

gianm approved these changes Jul 7, 2017

View reviewed changes

gianm merged commit d168a42 into apache:master Jul 7, 2017

gianm mentioned this pull request Jul 7, 2017

[Backport] Use Double.NEGATIVE_INFINITY and Double.POSITIVE_INFINITY #4518

Merged

b-slim reviewed Jul 7, 2017

View reviewed changes

leventov deleted the double-min-max-value-bugs branch July 7, 2017 17:20

Conversation

leventov commented Jul 1, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

gianm left a comment

Choose a reason for hiding this comment

Uh oh!

gianm commented Jul 6, 2017

Uh oh!

drcrallen commented Jul 6, 2017

Uh oh!

leventov commented Jul 6, 2017

Uh oh!

gianm commented Jul 7, 2017

Uh oh!

leventov commented Jul 7, 2017

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

b-slim Jul 7, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

b-slim commented Jul 7, 2017

Uh oh!

Choose a reason for hiding this comment

Uh oh!

gianm commented Jul 7, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

leventov commented Jul 1, 2017 •

edited

Loading

b-slim Jul 7, 2017 •

edited

Loading