Skip to content

Significantly improve event matching/filtering#2

Merged
ahobsonsayers merged 7 commits intomainfrom
ahs/improve-ticket-matching
Jul 2, 2025
Merged

Significantly improve event matching/filtering#2
ahobsonsayers merged 7 commits intomainfrom
ahs/improve-ticket-matching

Conversation

@ahobsonsayers
Copy link
Copy Markdown
Owner

@ahobsonsayers ahobsonsayers commented Jul 2, 2025

Significantly improve event filtering/matching by improving the normalisation of the event names and by implementing a home rolled string similarity algorithm for event names.

The string similarity algorithm works by using a modified Smith-Waterman local alignment algorithm to align a substring (desired event name) with a target string (the actual even name), and then a optimal string alignment Damerau-Levenshtein to calculate similarity on a word level within the substring. Word similarity is then averaged across all the words, after best alignment, to give the final similarity score.

This home rolled algorithm seems very robust and extremely suitable for our use case. This can be seen in the tests for the algorithm.

@ahobsonsayers ahobsonsayers merged commit f5f84bd into main Jul 2, 2025
2 checks passed
@ahobsonsayers ahobsonsayers deleted the ahs/improve-ticket-matching branch July 15, 2025 23:01
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant