This is a test challenge for DrwinTech interviewing process.
The application extract skills from uploaded CV.
Supported file formats: pdf, doc, docx
Temporary deployed on: https://drwin-tech-cvparser.herokuapp.com
These instructions will get you a copy of the project up and running on your local machine for development and testing purposes. See deployment for notes on how to deploy the project on a live system.
- Python 3.8.1
- get project files from github
git clone https://github.com/eugeny-m/CVParser.git
cd CVParser - create python environment
python3.8 -m venv venv - activate venv
source venv/bin/activate - install packages
pip install -r requirements.txt - download nltk packages
python -m nltk.downloader words
python -m nltk.downloader stopwords - create tables
python manage.py migrate - collect project static
python manage.py collectstatic - run server on 127.0.0.1:8000
DJANGO_DEBUG=1 python manage.py runserver
Application has config for deploying on heroku server.
Just create account and app on https://dashboard.heroku.com/apps
and deploy with heroku git https://devcenter.heroku.com/articles/git
- Python 3.8.1
- Django 3.0.2 - The web framework used
- Spacy 2.2.3
- nltk 3.4.5
- Eugeny Maksimov - eugeny-m
- Great thanks to project PyResparser. It was taken as base for skills extractor