Skip to content

Conversation

@wesm
Copy link
Member

@wesm wesm commented Mar 6, 2020

Also adds a unit test for ARROW-7956 that fails in 0.15.0 and 0.15.1 but passes in 0.16.0.

Since these tests are time consuming I set them off by default, but we need to run them regularly. I opened ARROW-8048 about setting up a nightly build to run these.

@github-actions
Copy link

github-actions bot commented Mar 6, 2020

@wesm
Copy link
Member Author

wesm commented Mar 6, 2020

@wesm
Copy link
Member Author

wesm commented Mar 9, 2020

+1. I'll open a JIRA about running these tests nightly (along with ARROW-4046 for the "large memory" tests, seems like these go together)

@wesm
Copy link
Member Author

wesm commented Mar 9, 2020

The code in https://github.com/dask/distributed/blob/master/distributed/pytest_resourceleaks.py seems superior to what I implemented, but what is here will enable us to begin writing memory leak tests, and later we can improve the "harness" around these tests

@wesm wesm closed this in 2a0da6d Mar 9, 2020
@wesm wesm deleted the ARROW-4120 branch March 9, 2020 22:52
@pytest.mark.pandas
def test_deserialize_pandas_arrow_7956():
df = pd.DataFrame({'a': np.arange(10000),
'b': [pd.util.testing.rands(5) for _ in range(10000)]})
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Future note: we should not use pandas.util.testing, as that is deprecated (my bad for using it in the reproducer example)

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Ah thanks for letting me know. Looks like we have a couple lingering usages

python/pyarrow/tests/test_adhoc_memory_leak.py
35:                       'b': [pd.util.testing.rands(5) for _ in range(10000)]})

python/pyarrow/tests/test_plasma.py
411:        pd.util.testing.assert_frame_equal(df, result)

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes, I already cleaned up most some time ago, but clearly missed the plasma one.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants