Skip to content

[Rust] [DataFusion] Improve API for usage of UDFs #25875

@asfimport

Description

@asfimport

TL;DR; currently, users call UDFs through
 
df.select(scalar_functions(“sqrt”, vec![col(“a”)], DataType::Float64))
 
Proposal:
 
let f = df.registry();

df.select(f.udf(“sqrt”, vec![col(“a”)])?)
 
so that they do not have to remember the UDFs return type when using it.
 
This API will in the future allow to declare the UDF as part of the planning, like spark, instead of having to register it in the registry before using it (we just need to check if the UDF is registered or not before doing so).
See complete proposal here: https://docs.google.com/document/d/1Kzz642ScizeKXmVE1bBlbLvR663BKQaGqVIyy9cAscY/edit?usp=sharing

 

Reporter: Jorge Leitão / @jorgecarleitao
Assignee: Jorge Leitão / @jorgecarleitao

Related issues:

PRs and other links:

Note: This issue was originally created as ARROW-9836. Please see the migration documentation for further details.

Metadata

Metadata

Type

No type

Projects

No projects

Milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions