Skip to content

Conversation

@calcaura
Copy link

@calcaura calcaura commented Feb 12, 2026

Which issue does this PR close?

Rationale for this change

  • Implement the Spark function “regexp_extract" in Datafusion.

What changes are included in this PR?

What changes are NOT included in this PR?

  • Support for LargeUtf8.
  • Utf8View

Are these changes tested?

  • Yes, Unit tests + SQL + CI
# Unit tests
cargo test --package datafusion-functions --lib -- regex::regexpextract::tests --nocapture
# SQL tests
cargo test --test sqllogictests -- regexp_extract

Are there any user-facing changes?

Yes (new regex function added added to the docs).

@calcaura calcaura marked this pull request as draft February 12, 2026 10:33
@github-actions github-actions bot added the functions Changes to functions implementation label Feb 12, 2026
@github-actions github-actions bot added documentation Improvements or additions to documentation sqllogictest SQL Logic Tests (.slt) labels Feb 12, 2026
@calcaura calcaura marked this pull request as ready for review February 12, 2026 14:45
@Jefffrey
Copy link
Contributor

cc @Omega359 @comphead did we ever land on a consensus regarding regexp_extract and regexp_substr? We had some PRs for them before and they seemed to lapse, but looks like there was still some discussion on which regex functions we include as part of datafusion

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

documentation Improvements or additions to documentation functions Changes to functions implementation sqllogictest SQL Logic Tests (.slt)

Projects

None yet

Development

Successfully merging this pull request may close these issues.

regexp_extract func from Spark

2 participants