Skip to content

[C++][Parquet] Getting only 0 when reading DELTA_BINARY_PACKED #15052

@Anieway

Description

@Anieway

Describe the bug, including details regarding any error messages, version, and platform.

I have encountered a possible bug where when reading values one at a time with the low level API of parquet reader
(as done similarly in the example: /cpp/examples/parquet/low_level_api/reader_writer.cc)
in a row with DELTA_BINARY_PACKED encoding the results are all 0 regardless of file content.

aka this rows_read = int64_reader->ReadBatch(1, &definition_level, &repetition_level, &value, &values_read); gives a zero value
but this rows_read = int64_reader->ReadBatch(2, nullptr, nullptr, values, &values_read); gives the correct value.

So this seems not to occur when reading bigger batch-sizes (>1).

The problem might be somewhere within DeltaBitPackDecoder<DType>::GetInternal

Component(s)

Parquet

Metadata

Metadata

Assignees

Type

No type

Projects

No projects

Milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions