-
Notifications
You must be signed in to change notification settings - Fork 4k
Closed as not planned
Labels
Component: C++Component: ParquetStatus: stale-warningIssues and PRs flagged as stale which are due to be closed if no indication otherwiseIssues and PRs flagged as stale which are due to be closed if no indication otherwiseType: enhancement
Description
A recent parquet-format release also added support for a Page Index making it possible to skip pages within a RowGroup.
This should be implemented by Apache Arrow.
Reporter: Malthe Borch
Assignee: Gang Wu / @wgtmac
Subtasks:
- [C++][Parquet] FileMetaData exposes page index metadata
- [C++][Parquet] Introduce ColumnIndex and OffsetIndex
- [C++][Parquet] Parquet page index read support
- Support predicate pushdown for page index in Parquet dataset scanner
- Enable writing page index by default
Related issues:
- [C++] Add index pages to the format to support efficient page skipping to parquet-cpp (is duplicated by)
Note: This issue was originally created as ARROW-10158. Please see the migration documentation for further details.
Metadata
Metadata
Assignees
Labels
Component: C++Component: ParquetStatus: stale-warningIssues and PRs flagged as stale which are due to be closed if no indication otherwiseIssues and PRs flagged as stale which are due to be closed if no indication otherwiseType: enhancement