-
Notifications
You must be signed in to change notification settings - Fork 4k
Closed
Labels
Component: C++Component: ParquetCritical FixBugfixes for security vulnerabilities, crashes, or invalid data.Bugfixes for security vulnerabilities, crashes, or invalid data.Type: bug
Milestone
Description
Describe the bug, including details regarding any error messages, version, and platform.
I have encountered a possible bug where when reading values one at a time with the low level API of parquet reader
(as done similarly in the example: /cpp/examples/parquet/low_level_api/reader_writer.cc)
in a row with DELTA_BINARY_PACKED encoding the results are all 0 regardless of file content.
aka this rows_read = int64_reader->ReadBatch(1, &definition_level, &repetition_level, &value, &values_read); gives a zero value
but this rows_read = int64_reader->ReadBatch(2, nullptr, nullptr, values, &values_read); gives the correct value.
So this seems not to occur when reading bigger batch-sizes (>1).
The problem might be somewhere within DeltaBitPackDecoder<DType>::GetInternal
Component(s)
Parquet
Metadata
Metadata
Assignees
Labels
Component: C++Component: ParquetCritical FixBugfixes for security vulnerabilities, crashes, or invalid data.Bugfixes for security vulnerabilities, crashes, or invalid data.Type: bug