When we have a FilterExec in the plan, it can produce lots of small batches and we therefore lose efficiency of vectorized operations.
We should implement a new CoalesceBatchExec and wrap every FilterExec with one of these so that small batches can be recombined into larger batches to improve the efficiency of upstream operators.
Reporter: Andy Grove / @andygrove
Assignee: Andy Grove / @andygrove
PRs and other links:
Note: This issue was originally created as ARROW-11058. Please see the migration documentation for further details.