-
Notifications
You must be signed in to change notification settings - Fork 4k
Open
Description
FileReaderImpl::ReadRowGroup fails with "Nested data conversions not implemented for chunked array outputs". It fails on ChunksToSingle
Data schema is:
optional group fields_map (MAP) = 217 {
repeated group key_value {
required binary key (STRING) = 218;
optional binary value (STRING) = 219;
}
}
fields_map.key_value.value-> Size In Bytes: 13243589 Size In Ratio: 0.20541047
fields_map.key_value.key-> Size In Bytes: 3008860 Size In Ratio: 0.046667963Is there a way to work around this issue in the cpp lib?
In any case, I am willing to implement this, but I need some guidance. I am very new to parquet (as in started reading about it yesterday).
Probably related to: https://issues.apache.org/jira/browse/ARROW-10958
Reporter: Arthur Passos
Assignee: Arthur Passos
Related issues:
- [Python] read_row_group fails with Nested data conversions not implemented for chunked array outputs (duplicates)
Note: This issue was originally created as ARROW-17459. Please see the migration documentation for further details.
lhoestqarthurpassos and artemru