Arabic Dialect Identification

This NLP project purpose is to predict the dialect of Arabic language in tweets

Dataset

The dataset used in this project is a collection of Tweets labeled with their corresponding dialects, The dialects are from 5 countries which are Egypt ('EG'), Lebanon ('LB'), Libya ('LY'), Sudan ('SD'), and Morocco ('MA')

Expriments

in this project i used variety of models here is a summary of the results:

Model	F1_score	accuracy_val
TF-IDF Multinomial Naive Bayes	0.69	0.72
LSTM	0.82	0.82
GRU	0.79	0.79
Hybrid Model `char and token embeddings`	0.80	0.80
Arabert	0.84	0.84

The API and sample output

The api created using FastAPI
tested the app

How to Run

Clone the Repository

git clone https://github.com/ronysalem/Arabic-Dialects-Identification

Navigate to the Project Directory

cd Arabic-Dialects-Identification

Build and Run the Docker Container

docker-compose up --build

Accessing the Application Once the application is running, you can access it by opening a web browser and visiting:

http://localhost:8000

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
Data		Data
Images		Images
Models		Models
Notebooks		Notebooks
Tokenizer		Tokenizer
static		static
templates		templates
.dockerignore		.dockerignore
.gitattributes		.gitattributes
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
docker-compose.yaml		docker-compose.yaml
main.py		main.py
preprocess.py		preprocess.py
requirements.in		requirements.in
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Arabic Dialect Identification

Dataset

Expriments

The API and sample output

How to Run

About

Uh oh!

Releases

Packages

Uh oh!

Languages

ronysalem/Arabic-Dialects-Identification

Folders and files

Latest commit

History

Repository files navigation

Arabic Dialect Identification

Dataset

Expriments

The API and sample output

How to Run

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages