[C++][Dataset] Optionally encode partition field values as dictionary type

In the Python ParquetDataset implementation, the partition fields are returned as dictionary type columns. 

In the new Dataset API, we now use a plain type (integer or string when inferred). But, you can already manually specify that the partition keys should be dictionary type by specifying the partitioning schema (in `Partitioning` passed to the dataset factory). 

Since using dictionary type can be more efficient (since partition keys will typically be repeated values in the resulting table), it might be good to still have an option in the DatasetFactory to use dictionary types for the partition fields.

See also https://github.com/apache/arrow/pull/6303#discussion_r400622340

**Reporter**: [Joris Van den Bossche](https://issues.apache.org/jira/browse/ARROW-8647) / @jorisvandenbossche
**Assignee**: [Ben Kietzman](https://issues.apache.org/jira/browse/ARROW-8647) / @bkietz
#### Related issues:
- [[C++][Dataset] Discovery of partition field as dictionary type segfaulting with HivePartitioning](https://github.com/apache/arrow/issues/25380) (relates to)
#### PRs and other links:
- [GitHub Pull Request #7536](https://github.com/apache/arrow/pull/7536)

<sub>**Note**: *This issue was originally created as [ARROW-8647](https://issues.apache.org/jira/browse/ARROW-8647). Please see the [migration documentation](https://github.com/apache/arrow/issues/14542) for further details.*</sub>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[C++][Dataset] Optionally encode partition field values as dictionary type #24808

Related issues:

PRs and other links:

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[C++][Dataset] Optionally encode partition field values as dictionary type #24808

Description

Related issues:

PRs and other links:

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions