TextMining Project

The project is written with Python 3.7.9. Several libraries are used and the whole list is present in the file requirements.txt.

Also, in order to avoid version dependences problems, we've used a common anaconda environment that is specified in the file environment.yml.

Before execute

There are two ways to install the correct libraries:

with anaconda environment specific
```
 conda env create -f environment.yml
```
with pip library installation:
```
 pip install requirements.txt 
```

Folder structure

C:.
│   .gitignore
│   01-Text_Exploration.ipynb
│   02-Text_Processing_Representation.ipynb
│   03-Features_extraction.ipynb
│   04-Text_Classification.ipynb
│   05-Text_Classification_Binary.ipynb
│   environment.yml
│   README.md
│   requirements.txt
│   WordCloud.ipynb
│
├───data
│   │   featured_data.csv
│   │   labeled_data.csv
│   │   mask_marco.png
│   │   processed_data.csv
│   │   trump_tweets.csv
│   │
│   └───representations
│           bag_of_words.npz
│           count_vector.npz
│           doc2vec.npy
│           tf-idf.npy
│
├───models
│   ├───base
│   │       README.md
│   │
│   ├───binary
│   │       README.md
│   │
│   └───doc2vec
│           README.md
│
└───pics
    │   sentiment.PNG
    │
    └───wordcloud
            hate.PNG
            hate_offensive.PNG
            neither.PNG
            not_hate.PNG
            offensive.PNG

Execute

The execution follows the order of notebooks:

   01-Text_Exploration.ipynb
   02-Text_Processing_Representation.ipynb
   03-Features_extraction.ipynb
   04-Text_Classification.ipynb
   05-Text_Classification_Binary.ipynb

Each notebook generate some output files in the execution that is used in the followings notebooks.

This division is done for code cleanup reasons.

NB In Text_Classification notebooks neural network can be changed by changing the function used.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

TextMining Project

Before execute

Folder structure

Execute

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 52 Commits
data		data
models		models
pics		pics
.gitignore		.gitignore
01-Text_Exploration.ipynb		01-Text_Exploration.ipynb
02-Text_Processing_Representation.ipynb		02-Text_Processing_Representation.ipynb
03-Features_extraction.ipynb		03-Features_extraction.ipynb
04-Text_Classification.ipynb		04-Text_Classification.ipynb
05-Text_Classification_Binary.ipynb		05-Text_Classification_Binary.ipynb
PresentationTextMining.pptx		PresentationTextMining.pptx
README.md		README.md
WordCloud.ipynb		WordCloud.ipynb
environment.yml		environment.yml
requirements.txt		requirements.txt

MarcoP9/textmining

Folders and files

Latest commit

History

Repository files navigation

TextMining Project

Before execute

Folder structure

Execute

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages