Twitter sentiment analysis

Data Collection

The tweets were collected using streaming endpoint of twitter-api filtered according to the location (I used the coordinates of US) and stored in mong-db.
For NER, Stanford's NLTK library is used with 4 class Stanford Named Entity Recognizer
Based on frequency, top named entities are recognized and relevant news articles were retrieved using News API
For sentiment analysis TextBlob is used. Individual sentiment value was calculated for tweets and news related to each named entity adn then its mean was taken to get the final sentiment value.

Files

GraphInt.py: Contains code for the online app.
config.py: Parametes used in the app. App keys, database name, etc.
db_op.py: Used for fetching values(created_at, location) from mongodb. The functions take id as the argument.
get_ner.py: Used to extract named entities. The function getNER returns a list of top named entities and a dictionary of NE and cleaned tweets.
get_news.py: Takes list of named entities as input and returns a dict of named entities and fetched headlines.
get_sentiment.py: Takes the headlines and tweet dict as input and returns avg sentiment scores for each entity. Score > 0 is positive, score < 0 is negative and 0 signifies neutral.
get_tweets.py: Used to fetch tweets from Twitter API and store them to mongoDB
main.py: Used for generating the graphs below.
mongoload.py: Used for dumping data to database on droplet.
plot_data.py: Contains 3 functions to plot graphs for sentiment, timeseries data of tweets and location.
twitter.json: mongoDB dump

Link to droplet

The images might takes some to update even after automatic refresh

TODO:

Pull requests are welcome.

Collect a random sample of 10K tweets using the Twitter API and store them in a MongoDB instance.
From these collected tweets, parse the 5 most frequently occurring named-entities (can be a name, person, location, product etc.).
Now, collect the latest news from various news source APIs featuring the named-entities you got from Step 2 (use at least one other API/library other than Twitter's to collect this data).
Perform a Sentiment Analysis on the data collected in Step 1 and 3, and compare the twitter and news sentiments for the common named-entities.
You should also perform temporal, spatial and content analysis on the collected data, to answer questions such as Who posted the data, What was it about, When was it posted, from Where was it posted etc.
Report these results you found in the steps 5 & 6 using graphs. Brownie points for cool interactive visualisations.
Set up a web application on Heroku or Digital Ocean Droplet with a user interface where we can input a named-entity and get the comparison between the news and twitter sentiments as an output.
Put all your code, along with the MongoDB collection, in a GitHub repository and share the link with us. Also, maintain a README.md explaining your codebase and the approach you followed.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
.idea		.idea
__pycache__		__pycache__
figures		figures
obj		obj
stanford-ner-2018-02-27		stanford-ner-2018-02-27
static		static
templates		templates
GraphInt.py		GraphInt.py
README.md		README.md
config.py		config.py
config.pyc		config.pyc
db_op.py		db_op.py
db_op.pyc		db_op.pyc
get_ner.py		get_ner.py
get_ner.pyc		get_ner.pyc
get_news.py		get_news.py
get_news.pyc		get_news.pyc
get_sentiment.py		get_sentiment.py
get_sentiment.pyc		get_sentiment.pyc
get_tweets.py		get_tweets.py
get_tweets.pyc		get_tweets.pyc
main.py		main.py
mongoload.py		mongoload.py
plot_data.py		plot_data.py
plot_data.pyc		plot_data.pyc
test_file.py		test_file.py
twitter.json		twitter.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Twitter sentiment analysis

Data Collection

Files

Link to droplet

TODO:

References:

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Twitter sentiment analysis

Data Collection

Files

Link to droplet

TODO:

References:

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages