WIP Experimental integration of nocopy cloudpickle with bytelist API #1643

ogrisel · 2017-12-18T18:12:07Z

This is a work in progress integration of an experimental branch of cloudpickle that allows for nocopy dump and load of nested numpy arrays using a bytelist API.

This relies on: cloudpipe/cloudpickle#138

In particular, this should help make the workers more stable when dealing with large numpy arrays or pandas data frames: spilling to disk (and loading back spilled data structures) should no longer incur large temporary buffer allocations. Also the _BytelistFile helper class is not properly tested.

There are broken tests (e.g. I just noticed that pickling arrays of objects is broken) but I wanted to do a full run on CI and communicate about the final goal of my work on cloudpickle to other cloudpickle and dask developers.

mrocklin · 2017-12-18T18:31:29Z

I'm glad to see this work happen. Hopefully the Dask test suite can provide some useful feedback!

ogrisel · 2017-12-18T18:33:35Z

I have to work on other things in the coming days but plan to resume work ASAP.

ogrisel · 2019-03-18T09:21:00Z

Closing this as cloudpipe/cloudpickle#138 was closed in favor of PEP 574 in upstream python and numpy.

ogrisel added 3 commits December 18, 2017 08:51

cloudpickle-based bytelist serialization

bff4de9

Point to cloudpickle nocopy-memoryview branch

7de6327

Use nocopy unpickle

a38486e

ogrisel closed this Mar 18, 2019

ogrisel deleted the cloudpickle_dump_load_bytelist branch March 18, 2019 09:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

WIP Experimental integration of nocopy cloudpickle with bytelist API #1643

WIP Experimental integration of nocopy cloudpickle with bytelist API #1643

Uh oh!

ogrisel commented Dec 18, 2017 •

edited

Loading

Uh oh!

mrocklin commented Dec 18, 2017

Uh oh!

ogrisel commented Dec 18, 2017

Uh oh!

ogrisel commented Mar 18, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

WIP Experimental integration of nocopy cloudpickle with bytelist API #1643

WIP Experimental integration of nocopy cloudpickle with bytelist API #1643

Uh oh!

Conversation

ogrisel commented Dec 18, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mrocklin commented Dec 18, 2017

Uh oh!

ogrisel commented Dec 18, 2017

Uh oh!

ogrisel commented Mar 18, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ogrisel commented Dec 18, 2017 •

edited

Loading