-
Notifications
You must be signed in to change notification settings - Fork 4k
GH-20127: [Python] Remove deprecated pyarrow.filesystem legacy implementations #39825
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
GH-20127: [Python] Remove deprecated pyarrow.filesystem legacy implementations #39825
Conversation
jorisvandenbossche
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looking good!
I assume there will be some declarations in libarrow.pxd that were used in _hdfsio.pyx and now are no longer used, and can be removed.
b479920 to
e005766
Compare
|
Is this ready for review @AlenkaF ? |
|
I still need to fix one test which was failing and I added try-except (see #39825 (comment)) but I need to change the approach. Plan to work on it tomorrow! |
285d909 to
142af85
Compare
|
Ready for review @jorisvandenbossche @pitrou |
pitrou
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks a lot @AlenkaF . Here are a couple comments, but this LGTM on the principle.
Co-authored-by: Antoine Pitrou <pitrou@free.fr>
Co-authored-by: Antoine Pitrou <pitrou@free.fr>
jorisvandenbossche
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good to me now! Just a few small nits
Co-authored-by: Joris Van den Bossche <jorisvandenbossche@gmail.com>
|
The one failure on Windows is #40337 |
|
After merging your PR, Conbench analyzed the 7 benchmarking runs that have been run so far on merge-commit 2b194ad. There were no benchmark performance regressions. 🎉 The full Conbench report has more details. It also includes information about 1 possible false positive for unstable benchmarks that are known to sometimes produce them. |
…sis setup (#40363) ### Rationale for this change Small follow-up on #39825, which removed the `test_hdfs.py` file itself, but didn't remove it from the hypothesis script * GitHub Issue: #20127 Authored-by: Joris Van den Bossche <jorisvandenbossche@gmail.com> Signed-off-by: Joris Van den Bossche <jorisvandenbossche@gmail.com>
The current implementation of EnhancedFileSystem is based on the legacy pyarrow filesystem interface that was removed in pyarrow 16.0.0 (apache/arrow#39825). We can entirely replace EnhancedFileSystem with fsspec. For HDFS fsspec relies on the new pyarrow filesystem interface. Behavior change note: for put, fsspec doesn't preserve file permissions Resolves #87
The current implementation of EnhancedFileSystem is based on the legacy pyarrow filesystem interface that was removed in pyarrow 16.0.0 (apache/arrow#39825). We can entirely replace EnhancedFileSystem with fsspec. For HDFS fsspec relies on the new pyarrow filesystem interface. Behavior change note: for put, fsspec doesn't preserve file permissions Resolves #87
This PR removes the
pyarrow.filesystemandpyarrow.hdfsfilesystems that have been deprecated since 2.0.0.