Mini-project for Codoc code interview.
- Install missing dependencies if needed
- Change directory to the codoc-defi-code folder
- Execute
python main.py
in command line
- main.py : main file containing parsing and processing functions
- db.py : containing functions relating to the database (drwh.db)
- readFile.py : containing functions used to convert pdf/docx to ASCII
- regex.py : containing functions used to retrieve DATE and AUTHOR using regex