-
Notifications
You must be signed in to change notification settings - Fork 4k
Description
The place internally where the "legacy" pyarrow.filesystem filesystems are still used is in the pyarrow.parquet module.
It is used in:
-
ParquetWriter
-
ParquetManifest/ParquetDataset
-
write_to_dataset
For
ParquetWriter, we need to update this to work with the new filesystems (since ParquetWriter is not dataset related, and thus won't be deprecated).
ForParquetManifest/ParquetDataset, it might not need to be updated, since those might get deprecated itself (to be discussed -> ARROW-9720), and when using theuse_legacy_dataset=Falseoption, it already uses the new datasets.
Forwrite_to_dataset, this might depend on how the writing capabilities of the dataset project evolve.
Reporter: Joris Van den Bossche / @jorisvandenbossche
Assignee: Joris Van den Bossche / @jorisvandenbossche
PRs and other links:
Note: This issue was originally created as ARROW-9718. Please see the migration documentation for further details.