Skip to content

Conversation

@chreman
Copy link
Member

@chreman chreman commented Aug 31, 2020

This PR fixes the root cause of two edge cases, which sometimes intersect: documents with single (or duplicate) documents that have no keywords. In theses cases, two bugs could occur individually or simultaneously: appearance of stopwords in bubble titles that should be excluded; wrongly increased number of bubbles.

The bubble title replacement was not working at the correct level: instead of replacing single bubble titles for one (or multiple) documents, multiple area titles were created according to the count of documents with missing subjects.

This issue has now been fixed, resulting in correct number of bubbles and preventing appearance of stopwords in bubble titles.

@chreman chreman merged commit 88ede04 into master Sep 7, 2020
@chreman chreman deleted the fix-titlestopwords branch September 7, 2020 11:20
chreman added a commit to chreman/Headstart that referenced this pull request Oct 13, 2021
…itlestopwords

fix stopwords in areatitles

Former-commit-id: 88ede04
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants