Skip to content

Configure statistics_truncate_length in Parquet writer #14601

@patchwork01

Description

@patchwork01

Is your feature request related to a problem or challenge?

DataFusion has deprecated the configuration option datafusion.execution.parquet.max_statistics_size, because it's not used:

It seems to have been replaced in the Parquet library by statistics_truncate_length, added here:

There doesn't seem to be a way to set this in DataFusion.

Describe the solution you'd like

We'd like a configuration option for statistics_truncate_length in ParquetOptions, to be applied to the Parquet WriterProperties similarly to column_index_truncate_length.

Describe alternatives you've considered

No response

Additional context

This causes the following issue:

Metadata

Metadata

Assignees

Labels

enhancementNew feature or request

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions