Skip to content

Conversation

@simonjayhawkins
Copy link
Member

using pyarrow native functions for _str_startswith/_str_endswith #41222 were merged before _str_contains #41217 so we can de-duplication some logic.

and also a change to _str_match for perf gain on object fallback

-      32.8±0.2ms       25.5±0.1ms     0.78  strings.Methods.time_match('arrow_string')

@simonjayhawkins simonjayhawkins added Refactor Internal refactoring of code Strings String extension data type and string data labels May 15, 2021
@simonjayhawkins simonjayhawkins added this to the 1.3 milestone May 15, 2021
@simonjayhawkins
Copy link
Member Author

AFAICT the coverage build uses pyarrow 4.0.0 so the extra loc for the fallback won't be hit but are tested in other envs

@jreback jreback merged commit 31ea5ed into pandas-dev:master May 17, 2021
@jreback
Copy link
Contributor

jreback commented May 17, 2021

thanks!

@simonjayhawkins simonjayhawkins deleted the tidy branch May 17, 2021 15:53
TLouf pushed a commit to TLouf/pandas that referenced this pull request Jun 1, 2021
JulianWgs pushed a commit to JulianWgs/pandas that referenced this pull request Jul 3, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Refactor Internal refactoring of code Strings String extension data type and string data

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants