Skip to content

Conversation

@alamb
Copy link
Contributor

@alamb alamb commented Jan 30, 2024

Which issue does this PR close?

Closes #9074
Part of #8045 (making functions modular in DataFusion)

Rationale for this change

See #9074 (basically set us up for pulling aggregates and window functions out of the core)

What changes are included in this PR?

  1. Add FunctionRegistry::register_udaf and FunctionRegistry::register_udwf
  2. Implement them for SessionState
  3. Update SessionContext to use that API rather than modifying session state directly

Are these changes tested?

Covered by existing tests

Are there any user-facing changes?

@github-actions github-actions bot added logical-expr Logical plan and expressions physical-expr Changes to the physical-expr crates core Core DataFusion crate labels Jan 30, 2024
@alamb alamb force-pushed the alamb/more_functions branch from c0d88f7 to 7eeb52e Compare January 31, 2024 22:47
@github-actions github-actions bot removed logical-expr Logical plan and expressions physical-expr Changes to the physical-expr crates labels Jan 31, 2024
state
.scalar_functions
.insert(f.name().to_string(), Arc::new(f));
state.register_udf(Arc::new(f)).ok();
Copy link
Contributor Author

@alamb alamb Jan 31, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

moved into SessionState -- it is important that the alias resolution is done there so when new functions are registered via SessionState their aliases are as well. Otherwise aliases are only added when the function is defined via SessionContext

///
/// Returns an error (the default) if the function can not be registered,
/// for example if the registry is read only.
fn register_udaf(
Copy link
Contributor Author

@alamb alamb Jan 31, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is the key (backwards compatible) API addition in this PR.

It sets us up for being able to pull out BuiltInAggregate and BuiltInWindowFunction

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'm wondering why this is a backwards compatible API? Does FunctionRegistry provide this API before?

Copy link
Contributor Author

@alamb alamb Feb 6, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think it is backwards compatible because it provides a default implementation (to return a Not Yet Implemented error)

Thus, any existing implementations of FunctionRegistry will continue to compile and work as it did before, without any required code changes

@alamb
Copy link
Contributor Author

alamb commented Feb 6, 2024

I wonder if you might have time to review this PR @comphead ?

Once it is merged, I can then start breaking out more functions (e.g. #9113)

Copy link
Contributor

@comphead comphead left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm as for me @alamb
Maybe we can do a one liner in register_udf

let aliases = udf.aliases();
for alias in aliases {
self.scalar_functions.insert(alias.to_string(), udf.clone());
}
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
}

alamb and others added 3 commits February 6, 2024 08:01
Co-authored-by: comphead <comphead@users.noreply.github.com>
Co-authored-by: Liang-Chi Hsieh <viirya@gmail.com>
@alamb alamb merged commit 77f9775 into apache:main Feb 7, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

core Core DataFusion crate

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Add FunctionRegistry::register_udaf and FunctionRegistry::register_udwf

3 participants