Skip to content

[C++][Parquet] Build logical schema tree mapping Arrow fields to Parquet schema levels #22477

@asfimport

Description

@asfimport

In several places in cpp/src/parquet/arrow, the FromParquetSchema function is used to construct fields using a filtered "view" of the Parquet schema. This is a hack caused by the lack of some kind of a "schema tree" which maps Parquet concepts to Arrow Field objects.

One manifestation of this issue is that I was unable to implement dictionary encoded subfields in cases like list<string>, where you want the inner field to be dictionary-encoded.

Patch forthcoming

Reporter: Wes McKinney / @wesm
Assignee: Wes McKinney / @wesm

Related issues:

PRs and other links:

Note: This issue was originally created as ARROW-6077. Please see the migration documentation for further details.

Metadata

Metadata

Assignees

Type

No type

Projects

No projects

Milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions