NLP-Amazon Review Guessing

Train a classifiew to predict the review score of a basic amazon review.

System Requirements:

python 3.6 or heighr
sklearn library
pandas library
seaborn library
numpy library
matplotlib library

Installation:

installation can be done using conda.

conda activate
python setup.py install

Run:

from ReviewClassifier import classify
results = classify(train_data_file_name, test_data_file_name)

or

from ReviewClassifier import ReviewClassifier
results = ReviewClassifier(train_file_name).fitLogisticRegression(test_file_name)

The Task At Hand:

This task deals with text classification, one of the most common supervised tasks on text. A set of reviews in three diverse domains is attached to this assignment, and you goal is to predict a review’s score (rating) based on its text. The ratings span values 1-5, meaning that is a 5-way classification.

Each review has the following format:

{"overall": 4.0, "verified": true, "reviewTime": "09 7, 2015", "reviewerID":
"A2HDTDZZK1V4KD", "asin": "B0018CLO1I", "reviewerName": "Rachel E.Battaglia",
"reviewText": "This is a great litter box! The door rarely gets stuck, it's
easy to clean, and my cat likes it. I took one star off because the large is
really small. My cat is 6 pounds and this isn't quite big enough for her. I'm
ordering this same brand in a large. Online price is much cheaper than pets
stores or other stores!", "summary": "Great Box, Get XL size!",
"unixReviewTime": 1441584000}

where “overall” refers to the rating (the “label” you learn to predict), “review Text” to the body of the review, and “summary” to its summary.

Reviews are split into train (with 2000 reviews per class), and test (with 400 reviews per class) – you train a classifier on train data, evaluate and report the results on test data. We don’t have a validation set in this case since you are likely to work with classifiers’ default parameters, i.e., no tuning is required.

Test Examples:

as we can see this are the confusion matrix of each of the tests on the topics of Automotive, Pet stores and sports.

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
Demo		Demo
Test		Test
src		src
.gitignore		.gitignore
Amazon Review Guessing.pdf		Amazon Review Guessing.pdf
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NLP-Amazon Review Guessing

System Requirements:

Installation:

Run:

The Task At Hand:

Test Examples:

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

NLP-Amazon Review Guessing

System Requirements:

Installation:

Run:

The Task At Hand:

Test Examples:

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages