Skip to content

[Rust] [DataFusion] MemTable::load() should load partitions in parallel #26248

@asfimport

Description

@asfimport

MemTable::load() should load partitions in parallel using async tasks, rather than loading one partition at a time.

Also, we should make batch size configurable. It is currently hard-coded to 1024*1024 which can be quite inefficient.

Reporter: Andy Grove / @andygrove
Assignee: Andy Grove / @andygrove

PRs and other links:

Note: This issue was originally created as ARROW-10251. Please see the migration documentation for further details.

Metadata

Metadata

Assignees

Type

No type
No fields configured for issues without a type.

Projects

No projects

Milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions