Extract text from PDF or Powerpoint documents
-
Fork/Clone
$ git clone https://github.com/CaptainVee/vidare.git
-
Cd into vidare
cd vidare
-
Create and activate a virtual environment:
python3 -m venv venv && source venv/bin/activate
-
Install the dependencies:
pip install -r requirements.txt
-
Apply the migrations and run the Django development server:
python manage.py makemigrations python manage.py migrate python manage.py runserver
-
Test at http://localhost:8000/
upload a document with the form at the top right corner and submit to get the extracted text.