Skip to content

Expand diacritics#157

Merged
geoffrey-wu merged 7 commits into
qbreader:mainfrom
Captain-Quack:patch-1
May 14, 2023
Merged

Expand diacritics#157
geoffrey-wu merged 7 commits into
qbreader:mainfrom
Captain-Quack:patch-1

Conversation

@Captain-Quack
Copy link
Copy Markdown
Contributor

I was torn on weather to include greek characters, decided to include them.

I was torn on weather to include greek characters, decided to include them.
Copy link
Copy Markdown
Member

@skysomorphic skysomorphic left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It might be worth considering adding multiletter transliterations in the future, such as:

German "ß" as "ss" or "sz"
Icelandic "þ" as "th"
Nordic "æ" as "ae"

@Captain-Quack
Copy link
Copy Markdown
Contributor Author

something like ['((ae)|æ)'] could work

@geoffrey-wu
Copy link
Copy Markdown
Member

Ran the backend tests; for searching cesaire 5 times, the old method took 16.285s, this method before any optimizations took 18.592s, and this method after optimizations took 15.355s. All of these times are well within error.

@geoffrey-wu geoffrey-wu merged commit ce033a0 into qbreader:main May 14, 2023
@Captain-Quack Captain-Quack deleted the patch-1 branch May 14, 2023 17:01
geoffrey-wu added a commit that referenced this pull request Jul 31, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants