Skip to content

[Python] Read and write pandas.DataFrame in pyarrow.serialize by decomposing the BlockManager rather than coercing to Arrow format #17782

@asfimport

Description

@asfimport

See discussion in dask/distributed#931

This will permit zero-copy reads for DataFrames not containing Python objects. In the event of an ObjectBlock these arrays will not be worse than pickle to reconstruct on the receiving side

Reporter: Wes McKinney / @wesm
Assignee: Wes McKinney / @wesm

Related issues:

PRs and other links:

Note: This issue was originally created as ARROW-1784. Please see the migration documentation for further details.

Metadata

Metadata

Assignees

Type

No type

Projects

No projects

Milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions