Skip to content

[C++][Dataset] Separate datasets backed by readers from InMemoryDataset #28047

@asfimport

Description

@asfimport

From ARROW-10882/#9802 

  • Backing an InMemoryDataset with a reader is misleading. Let's split that out into a separate class.
  • Dataset scanning can then use an I/O thread for the new class. (Note that for Python, we'll need to be careful to release the GIL before any operations so that the I/O thread can acquire the GIL to call into the underlying Python reader/file object.)
  • Longer-term, we should interface with Python's async.

Reporter: Weston Pace / @westonpace
Assignee: David Li / @lidavidm

Related issues:

PRs and other links:

Note: This issue was originally created as ARROW-12231. Please see the migration documentation for further details.

Metadata

Metadata

Assignees

Type

No type

Projects

No projects

Milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions