perf: Optimize trunc scalar performance by kumarUjjawal · Pull Request #19788 · apache/datafusion

kumarUjjawal · 2026-01-13T12:12:27Z

Which issue does this PR close?

Part of [EPIC] Optimize performance for slow expressions datafusion-comet#2986.

Rationale for this change

The current trunc implementation always converts scalar inputs to arrays via make_scalar_function, which introduces unnecessary overhead when processing single values.

What changes are included in this PR?

Add scalar fast path for trunc function to process Float32/Float64 scalar inputs directly
Handle optional precision argument for scalar inputs
Add scalar benchmarks to measure performance

Are these changes tested?

Yes all sqllogictest pass

Benchmark Results

Type	Before	After	Speedup
f64 scalar	256 ns	55 ns	4.6x
f32 scalar	247 ns	56 ns	4.4x

Are there any user-facing changes?

No

martin-g · 2026-01-13T12:56:56Z

+                match &args.args[1] {
+                    ColumnarValue::Scalar(Int64(Some(p))) => *p,
+                    ColumnarValue::Scalar(Int64(None)) => {
+                        return Ok(ColumnarValue::Scalar(ScalarValue::Float64(None)));


I think this should check the scalar type to decide whether to return Float64 or Float32

martin-g · 2026-01-13T13:04:08Z


 fn compute_truncate32(x: f32, y: i64) -> f32 {
    let factor = 10.0_f32.powi(y as i32);
    (x * factor).round() / factor


Not introduced in this PR but why f32::round() is used here instead of f32::trunc() ?
Same for f64 below.

fn main() { let factor = 10_f64; let r = (3.76_f64 * factor).round() / factor; let t = (3.76_f64 * factor).trunc() / factor; println!("round: {r}\ntrunc: {t}"); }

prints:

round: 3.8 trunc: 3.7

Yeah it does seem like a bug. I will file an issue

filed #19793

martin-g · 2026-01-13T13:05:11Z

+            };
+
+            match scalar {
+                ScalarValue::Float64(v) => {


Fast path for ScalarValue::Null too ?!

martin-g · 2026-01-13T13:06:00Z

+                ScalarValue::Float64(v) => {
+                    let result = v.map(|x| {
+                        if precision == 0 {
+                            if x == 0.0 { 0.0 } else { x.trunc() }


Suggested change

if x == 0.0 { 0.0 } else { x.trunc() }

x.trunc()

martin-g · 2026-01-13T13:06:11Z

+                ScalarValue::Float32(v) => {
+                    let result = v.map(|x| {
+                        if precision == 0 {
+                            if x == 0.0 { 0.0 } else { x.trunc() }


Suggested change

if x == 0.0 { 0.0 } else { x.trunc() }

x.trunc()

martin-g · 2026-01-13T13:08:14Z

+    )];
+    let scalar_arg_fields = vec![Field::new("a", DataType::Float64, false).into()];
+    let scalar_return_field = Field::new("f", DataType::Float64, false).into();
+    let config_options = Arc::new(ConfigOptions::default());


nit: This variable shadows the same one from line 40

kumarUjjawal · 2026-01-13T15:42:54Z

Thanks for the feedback @martin-g, incorporated the changes.

Jefffrey · 2026-01-14T15:34:51Z

+                return make_scalar_function(trunc, vec![])(&args.args);
+            }
+            None => Some(0), // default precision
+            _ => Some(0),


This catch all arm should return an internal error, unless theres a case I'm missing?

Yes it should. Made changes

Co-authored-by: Jeffrey Vo <jeffrey.vo.australia@gmail.com>

Jefffrey · 2026-01-17T03:13:10Z

Thanks @kumarUjjawal & @martin-g

## Which issue does this PR close?  - Part of apache/datafusion-comet#2986. ## Rationale for this change The current `trunc` implementation always converts scalar inputs to arrays via `make_scalar_function`, which introduces unnecessary overhead when processing single values.  ## What changes are included in this PR? - Add scalar fast path for `trunc` function to process Float32/Float64 scalar inputs directly - Handle optional precision argument for scalar inputs - Add scalar benchmarks to measure performance  ## Are these changes tested? Yes all sqllogictest pass ## Benchmark Results | Type | Before | After | Speedup | |------|--------|-------|---------| | f64 scalar | 256 ns | 55 ns | **4.6x** | | f32 scalar | 247 ns | 56 ns | **4.4x** |  ## Are there any user-facing changes? No   --------- Co-authored-by: Jeffrey Vo <jeffrey.vo.australia@gmail.com>

perf: Optimize trunc scalar performance

1888c99

github-actions Bot added the functions Changes to functions implementation label Jan 13, 2026

martin-g reviewed Jan 13, 2026

View reviewed changes

fix clippy and fix return type

8db0ed7

Jefffrey reviewed Jan 14, 2026

View reviewed changes

Comment thread datafusion/functions/src/math/trunc.rs Outdated

refactor to flatten nested structure

a2fd8b8

Jefffrey reviewed Jan 14, 2026

View reviewed changes

kumarUjjawal and others added 2 commits January 14, 2026 22:29

suggestion from Jeffrey

0180eb6

Co-authored-by: Jeffrey Vo <jeffrey.vo.australia@gmail.com>

catch arm returns exec_err! with message

e6e2825

Jefffrey approved these changes Jan 15, 2026

View reviewed changes

Jefffrey added this pull request to the merge queue Jan 17, 2026

Merged via the queue into apache:main with commit 3ea21aa Jan 17, 2026
28 checks passed

Conversation

kumarUjjawal commented Jan 13, 2026

Which issue does this PR close?

Rationale for this change

What changes are included in this PR?

Are these changes tested?

Benchmark Results

Are there any user-facing changes?

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

kumarUjjawal commented Jan 13, 2026

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Jefffrey commented Jan 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants