Skip to content

JIT-Compile DataFusion Expressions to create RecordBatches #2122

@Dandandan

Description

@Dandandan

Is your feature request related to a problem or challenge? Please describe what you are trying to do.
We should be able to compile.

The benefit of this is that we can speed up complex / nested expressions by avoiding unnecesarry allocations

Describe the solution you'd like

We should be able to take in a collection of a RecordBatch / named Arrays and compile an expression like (a + b)/ 2 to a loop that results in a new Array.

fn compile(schema: SchemaRef, expr: Expr) ->  CompiledFunction {

}

The loop itself also must be included in the to-be compiled expression, to remove call overhead and allow for possible use of SIMD instructionsinstructions, either explicitly by instrumenting cranelift enough or through auto-vectorization.

Describe alternatives you've considered
n/a

Additional context

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions