marcelovicentegc / toothbrush

A web-app to extract text patterns from texts 🦷

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Build Status

Toothbrush

A web-app that extracts the 20 most common words from .pdf, .doc and .txt files.

Demo

Directions

  1. Clone this repository:
  2. Install dependencies: cd Toothbrush/project/requirements; pip install -r base_win.txt; python -c "import nltk; nltk.download('stopwords'); nltk.download('punkt')" if on Windows, or cd Toothbrush/project/requirements && pip install -r base.txt && python -c "import nltk; nltk.download('stopwords'); nltk.download('punkt')" if on Mac/Linux
  3. Set a environment variable for DJANGO_TOOTHBRUSH_SECRET_KEY
  4. Run migrations: cd ../..; python manage.py makemigrations; python manage.py makemigrations toothpaste if on Windows, or cd ../.. && python manage.py makemigrations && python manage.py makemigrations toothpaste if on Mac/Linux
  5. Migrate database: python manage.py migrate
  6. Run the application: python manage.py runserver

About

A web-app to extract text patterns from texts 🦷

License:MIT License


Languages

Language:Python 56.7%Language:HTML 29.3%Language:CSS 10.5%Language:JavaScript 3.0%Language:Dockerfile 0.5%