Skip to content

A spam detection model to identify unsolicited and unwanted emails is created. Explored the process of classifying emails as spam or not spam, by converting text data(emails) into vectors and employed deep learning model.

Notifications You must be signed in to change notification settings

Ellaluo/SpamDetectionInEmails

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 

Repository files navigation

SpamDetectionInEmails

A spam detection model is created by converting text data into vectors and deep learning model.

Published an article to explain the project in depth

https://lionbridge.ai/articles/using-natural-language-processing-for-spam-detection-in-emails/

The dataset used is an open-source Spambase dataset from the UCI machine learning repository, which contains 5569 emails, of which 745 are spam.

Performed text preprocessing steps that involve removing punctuations, stop words, white spaces, URLs, and lower cases. And tokenized these cleaned texts and then padded the tokenized text. Created a deep learning model on these padded vectors.

Results:

Since the percentage of spam in data is often low, measured the model’s performance by F1score.

  • Precision: 99.21
  • Recall: 91.24
  • F1-score: 95.06

About

A spam detection model to identify unsolicited and unwanted emails is created. Explored the process of classifying emails as spam or not spam, by converting text data(emails) into vectors and employed deep learning model.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 100.0%