pip install virtualenv
virtualenv venv
venv/Scripts/activate
pip install -r requirements.txt
python -m spacy download en_core_web_sm
The directory data/cv_dataset
contains all the pdfs of candidates CVs.
This will extract all the required detials for you and will store in data/preprocessed_data/cv_details
.
python -u src/pdf_extraction.py
python -u main.py