CaptainVee / vidare

Extract text from PDF or Powerpoint documents

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Vidare

Extract text from PDF or Powerpoint documents

Want to use this project?

  1. Fork/Clone

    $ git clone https://github.com/CaptainVee/vidare.git
  2. Cd into vidare

     cd vidare
  3. Create and activate a virtual environment:

    python3 -m venv venv && source venv/bin/activate
  4. Install the dependencies:

    pip install -r requirements.txt
  5. Apply the migrations and run the Django development server:

     python manage.py makemigrations
     python manage.py migrate
     python manage.py runserver
  6. Test at http://localhost:8000/

upload a document with the form at the top right corner and submit to get the extracted text.

About

Extract text from PDF or Powerpoint documents


Languages

Language:Python 82.2%Language:HTML 17.7%Language:CSS 0.2%