Skip to content

Conversation

@justinvyu
Copy link
Contributor

@justinvyu justinvyu commented Apr 24, 2025

Summary

Fixes the bug mentioned in #52513.
The S3 filesystem serialization wrapper doesn't preserve the retryable_erros and wrap_retries properties during serialization (the __reduce__ didn't include those properties to reconstruct it properly). This PR leaves the serialization logic to the RetryablePyFileSystem rather than have the buggy wrapping/unwrapping.

This S3 serialization wrapper cannot be removed still because there's a deserialization issue that still exists in pyarrow<14.0.0.

This workaround can be removed once the lower bound pyarrow version exceeds 14.0.0.

Signed-off-by: Justin Yu <justinvyu@anyscale.com>
Signed-off-by: Justin Yu <justinvyu@anyscale.com>
Signed-off-by: Justin Yu <justinvyu@anyscale.com>
Signed-off-by: Justin Yu <justinvyu@anyscale.com>
Signed-off-by: Justin Yu <justinvyu@anyscale.com>
Signed-off-by: Justin Yu <justinvyu@anyscale.com>
Signed-off-by: Justin Yu <justinvyu@anyscale.com>
Signed-off-by: Justin Yu <justinvyu@anyscale.com>
@justinvyu justinvyu enabled auto-merge (squash) April 24, 2025 23:05
@github-actions github-actions bot added the go add ONLY when ready to merge, run all tests label Apr 24, 2025
@justinvyu justinvyu merged commit 8188632 into ray-project:master Apr 25, 2025
7 checks passed
@justinvyu justinvyu deleted the fix_s3_wrapper branch April 25, 2025 00:21
iamjustinhsu pushed a commit that referenced this pull request May 3, 2025
…eSystem` (#52568)

Fixes the bug mentioned in
#52513.
The S3 filesystem serialization wrapper doesn't preserve the
`retryable_erros` and `wrap_retries` properties during serialization
(the `__reduce__` didn't include those properties to reconstruct it
properly). This PR leaves the serialization logic to the
`RetryablePyFileSystem` rather than have the buggy wrapping/unwrapping.

This S3 serialization wrapper cannot be removed still because there's a
[deserialization issue](apache/arrow#38375) that
still exists in pyarrow<14.0.0.

This workaround can be removed once the lower bound pyarrow version
exceeds 14.0.0.

---------

Signed-off-by: Justin Yu <justinvyu@anyscale.com>
Signed-off-by: jhsu <jhsu@anyscale.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

community-backlog go add ONLY when ready to merge, run all tests

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants