-
Notifications
You must be signed in to change notification settings - Fork 4k
Closed
Milestone
Description
Currently, our aggregations are made in a simple loop. However, as described here, horizontal operations can also be SIMDed, reports of 2.7x speedups.
The goal of this improvement is to support SIMD for the "sum", for primitive types.
The code to modify is in here. A good indication that this issue is completed is when the script
cargo bench --bench aggregate_kernels && cargo bench --bench aggregate_kernels --features simd
yields a speed-up.
Reporter: Jorge Leitão / @jorgecarleitao
Assignee: Jörn Horstmann / @jhorstmann
PRs and other links:
Note: This issue was originally created as ARROW-10015. Please see the migration documentation for further details.