Decimal multiply kernel should not cause precision loss #5980

viirya · 2023-04-12T20:00:56Z

Which issue does this PR close?

Closes #5674.
Closes #3387.
Closes #4024.

Rationale for this change

Currently decimal multiplication in DataFusion silently truncates precision of result. It happens generally for regular decimal multiplication which doesn't overflow. Looks like DataFusion uses incomplete decimal precision coercion rule from Spark to coerce sides of decimal multiplication (and other arithmetic operators). The coerced type on two sides of decimal multiplication is not the resulting decimal type of multiplication. This (and how we computes decimal multiplication in the kernels) leads to truncated precision in the result decimal type.

What changes are included in this PR?

Moved decimal type coercion for math binary operators from TypeCoercion to physical binary operator
Fixed type coercion rule for decimal
- Produced correct coerced types
- Separated result type from coerced type

Are these changes tested?

Are there any user-facing changes?

viirya · 2023-04-12T20:07:40Z

Different to #5675, this doesn't add new expression node PromotePrecision and defers decimal type coercion to the phase of math expression evaluation. This approach is more close to how Spark handles decimal math coercion nowadays.

viirya · 2023-04-16T21:42:04Z

There is a compilation error. Going to fix it at #6029.

viirya · 2023-04-17T19:05:14Z

datafusion/physical-expr/src/expressions/binary.rs

+                    Some(99193548387), // 0.99193548387
+                    None,
+                    None,
+                    Some(100813008130), // 1.0081300813
+                    Some(100000000000), // 1.0
+                ],
+                21,
+                11,


Previously, this division losses precision. Now we get it back.

viirya · 2023-04-17T19:16:26Z

datafusion/physical-expr/src/expressions/binary.rs

        // subtract: decimal array subtract int32 array
        let schema = Arc::new(Schema::new(vec![
-            Field::new("b", DataType::Int32, true),
            Field::new("a", DataType::Decimal128(10, 2), true),


Previously the field order is incorrect. But as we did coerce type on both side of the op anyway, so it still worked before. Now we don't coerce the decimal field (which is wrongly bound to Int32Array) before into binary expression, so wrong field causes an error.

viirya · 2023-04-17T19:16:59Z

datafusion/core/tests/sqllogictests/test_files/tpch.slt

    sum(l_extendedprice) as sum_base_price,
    sum(l_extendedprice * (1 - l_discount)) as sum_disc_price,
-    sum(l_extendedprice * (1 - l_discount) * (1 + l_tax)) as sum_charge,
+    sum(cast(l_extendedprice as decimal(12,2)) * (1 - l_discount) * (1 + l_tax)) as sum_charge,


See https://github.com/apache/arrow-datafusion/pull/5675/files#r1148798281

viirya · 2023-04-17T20:03:49Z

benchmarks/queries/q8.sql

+    cast(cast(sum(case
+                      when nation = 'BRAZIL' then volume
+                      else 0
+        end) as decimal(12,2)) / cast(sum(volume) as decimal(12,2)) as decimal(15,2)) as mkt_share


See https://github.com/apache/arrow-datafusion/pull/5675/files#r1152896889

viirya · 2023-04-17T20:04:29Z

datafusion/core/tests/sqllogictests/src/engines/conversion.rs

+pub fn i128_to_str(value: i128, precision: &u8, scale: &i8) -> String {
    big_decimal_to_str(
-        BigDecimal::from_str(&Decimal::from_i128_with_scale(value, scale).to_string())
+        BigDecimal::from_str(&Decimal128Type::format_decimal(value, *precision, *scale))


See https://github.com/apache/arrow-datafusion/pull/5675/files#r1148798935

viirya · 2023-04-17T21:38:03Z

This deals with the decimal precision issue without additional PromotePrecision node (#5675).

cc @alamb @liukun4515

Dandandan · 2023-04-18T07:36:53Z

I wonder if this already fixes #4024

viirya · 2023-04-18T07:53:38Z

I wonder if this already fixes #4024

Yea, just verified locally that this can pass verify_q6.

Dandandan

Looks great!

viirya · 2023-04-18T18:06:09Z

Thanks @Dandandan

Dandandan · 2023-04-19T07:57:04Z

Let's wait ~24hrs so other reviewers can have a chance.

Dandandan · 2023-04-19T08:01:45Z

FYI @mingmwang @andygrove this PR also has some effect on performance, as casting is changed (mostly reduced).

Dandandan · 2023-04-19T08:28:47Z

Ran the benchmarks for TPCH(SF=1) in memory.

Performance is mostly the same, except a ~30% improvement for q1 compared to main 🚀

alamb · 2023-04-24T17:12:47Z

🎉

viirya marked this pull request as draft April 12, 2023 20:01

github-actions bot added logical-expr Logical plan and expressions optimizer Optimizer rules physical-expr Changes to the physical-expr crates labels Apr 12, 2023

viirya mentioned this pull request Apr 12, 2023

Decimal multiply kernel should not cause precision loss #5675

Closed

github-actions bot added core Core DataFusion crate sqllogictest SQL Logic Tests (.slt) labels Apr 12, 2023

viirya force-pushed the fix_decimal_multiply_precision_loss4 branch 2 times, most recently from 54397f9 to 343ca79 Compare April 13, 2023 21:34

viirya added 9 commits April 16, 2023 18:17

Init

a93317a

More

92b4c65

More

650f7c5

More

6374d34

Fix

850b136

fix

469fbb6

More

27b3db0

Fix

9ff962f

Add query comment

0a88516

viirya force-pushed the fix_decimal_multiply_precision_loss4 branch from cb7e326 to 0a88516 Compare April 17, 2023 01:18

viirya added 5 commits April 16, 2023 19:42

Update expected plans

ce7a649

Fix

78e4c77

Fix clippy

15f01db

Fix

f459932

Fix

7d42e01

viirya commented Apr 17, 2023

View reviewed changes

Fix

854395f

viirya commented Apr 17, 2023

View reviewed changes

Fix

fc175c0

viirya marked this pull request as ready for review April 17, 2023 21:35

Enable verify_q6 test

9585a12

Dandandan approved these changes Apr 18, 2023

View reviewed changes

Dandandan merged commit e81f54b into apache:main Apr 20, 2023

mingmwang mentioned this pull request May 8, 2023

Significant performance downgrade to tpch-q1 #6278

Closed

Decimal multiply kernel should not cause precision loss #5980

Decimal multiply kernel should not cause precision loss #5980

Uh oh!

Conversation

viirya commented Apr 12, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Which issue does this PR close?

Rationale for this change

What changes are included in this PR?

Are these changes tested?

Are there any user-facing changes?

Uh oh!

viirya commented Apr 12, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

viirya commented Apr 16, 2023

Uh oh!

viirya Apr 17, 2023

Choose a reason for hiding this comment

Uh oh!

viirya Apr 17, 2023

Choose a reason for hiding this comment

Uh oh!

viirya Apr 17, 2023

Choose a reason for hiding this comment

Uh oh!

viirya Apr 17, 2023

Choose a reason for hiding this comment

Uh oh!

viirya Apr 17, 2023

Choose a reason for hiding this comment

Uh oh!

viirya commented Apr 17, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Dandandan commented Apr 18, 2023

Uh oh!

viirya commented Apr 18, 2023

Uh oh!

Dandandan left a comment

Choose a reason for hiding this comment

Uh oh!

viirya commented Apr 18, 2023

Uh oh!

Dandandan commented Apr 19, 2023

Uh oh!

Dandandan commented Apr 19, 2023

Uh oh!

Dandandan commented Apr 19, 2023

Uh oh!

alamb commented Apr 24, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

viirya commented Apr 12, 2023 •

edited

Loading

viirya commented Apr 12, 2023 •

edited

Loading

viirya commented Apr 17, 2023 •

edited

Loading