Skip to content

[Rust] [Datafusion] Optionally load tpch data into memory before running benchmark query #26238

@asfimport

Description

@asfimport

The tpch benchmark runtime seems to be dominated by csv parsing code and it is really difficult to see any performance hotspots related to actual query execution in a flamegraph.

With the date in memory and more iterations it should be easier to profile and find bottlenecks.

Reporter: Jörn Horstmann / @jhorstmann
Assignee: Jörn Horstmann / @jhorstmann

PRs and other links:

Note: This issue was originally created as ARROW-10240. Please see the migration documentation for further details.

Metadata

Metadata

Assignees

No one assigned

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions