-
Notifications
You must be signed in to change notification settings - Fork 4k
Open
Description
I proposed this ages ago in https://lists.apache.org/thread/xpn9gyrs6kqc3g9t8k4ts8dmy7yyxskq and am finally getting around to implementing it.
I propose to add ordering information to exec nodes (mostly for node validation) and indices to exec batches. This is a fundamental step to allow nodes to consume this ordering information to achieve features such as ARROW-10883 and ARROW-16628. It also can replace the complicated batch enumeration in the current scanner to support in-order table reassembly.
Reporter: Weston Pace / @westonpace
Assignee: Weston Pace / @westonpace
Watchers: Rok Mihevc / @rok
Related issues:
- [C++][Dataset] Preserve order when writing dataset (is required by)
PRs and other links:
Note: This issue was originally created as ARROW-17762. Please see the migration documentation for further details.