Skip to content

[Python] Memory leak in pyarrow functions .ipc.serialize_pandas/deserialize_pandas #24175

@asfimport

Description

@asfimport

Used python version is 3.7.4 (conda distribution)

OS: Ubunty 18.04

pandas version is 0.24.2

numpy version is 1.16.4

 

To reproduce the issue run the attached script pyarrow_mem_leak_test.py. Also put the attached file loans.parquet to the folder of working directory.

 

Also the reading and writing to parquet in memory do has memory leaks. To reproduce this run function test_parquet_leak() from the attached file pyarrow_mem_leak_test.py

The memory leak is 100% reproducible.

Reporter: Denis
Assignee: Wes McKinney / @wesm

Related issues:

Original Issue Attachments:

Note: This issue was originally created as ARROW-7956. Please see the migration documentation for further details.

Metadata

Metadata

Assignees

Type

No type

Projects

No projects

Milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions