From Paul:
multiple_essay_report.py is a script used to visualize document/token features of our spacy NLP pipeline at work. This script can be used to verify if pronouns like "they" are properly used in a document.
Currently, Paul has noted that coreferee (which is used for coreference resolution) fails to properly do this at the current version of spacy (specifically for pronoun antecedents). Our text examples include essays written for the GRE, which make use of pronouns in ways which spacy or coreferee are not trained to handle/properly identify. Now that we are updating to spacy 3.6+, we need to see if coreferee continues to do poorly; finally, we would like to see if other coreference modules perform better.
From Paul:
multiple_essay_report.pyis a script used to visualize document/token features of our spacy NLP pipeline at work. This script can be used to verify if pronouns like "they" are properly used in a document.Currently, Paul has noted that coreferee (which is used for coreference resolution) fails to properly do this at the current version of spacy (specifically for pronoun antecedents). Our text examples include essays written for the GRE, which make use of pronouns in ways which spacy or coreferee are not trained to handle/properly identify. Now that we are updating to spacy 3.6+, we need to see if coreferee continues to do poorly; finally, we would like to see if other coreference modules perform better.