add spacy.Language as valid argument for 'spacy_pipeline'#19
add spacy.Language as valid argument for 'spacy_pipeline'#19TimSchopf merged 2 commits intoTimSchopf:masterfrom dominik-schwabe:master
Conversation
|
Hi Dominik, thanks for the contribution. Can you also please add a short code example and explanation on how to use the new argument in the README.md file? Also, you can extract keyphrases from multiple documents with the same object and calling fit only once by using a list of documents as inputs. This probably solves the issue already. Best, |
|
I added a little example to the README.
Usually I make small experiments, where I inspect the results on one documents then change some things, try the changed document or try some different document. I also usually use |
This commit allows to reuse an object from
spacy.loadfor many different KeyphraseVectorizer objects. I noticed that the nlp objects gets loaded whenfitis called, which makes extracting keyphrases from multiple documents super slow when a model linken_core_web_mdis used.