Read out only the required columns from a Feather file on Disk

The LZ4 compressed feather format works really well and is quite nice and fast than Parquet when reading all the columns out. For single columns, looks like feather format does not yet support the capability to read out only the required columns from the disk. Are there any plans to add support for this? Here are some numbers to support my claim from an experiment with a table with 17 columns and 5 million rows in uncompressed parquet and LZ4 compressed feather format. No memory mapping involved.

 
pq_all_cols: 0.4179724836349487 ms
feather_all_cols: 0.26202451705932617 ms
pq_single_col: 0.10951032638549804 ms
feather_single_col: 0.2119576358795166 ms

**Reporter**: [Jayjeet Chakraborty](https://issues.apache.org/jira/browse/ARROW-13126) / @JayjeetAtGithub
#### Related issues:
- [[C++] Enable fine-grained I/O (coalescing) in IPC reader](https://github.com/apache/arrow/issues/28430) (relates to)

<sub>**Note**: *This issue was originally created as [ARROW-13126](https://issues.apache.org/jira/browse/ARROW-13126). Please see the [migration documentation](https://github.com/apache/arrow/issues/14542) for further details.*</sub>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Read out only the required columns from a Feather file on Disk #28827

Related issues:

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Read out only the required columns from a Feather file on Disk #28827

Description

Related issues:

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions