lorarjohns / lylas

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

LYLAS Lab on Pay Parity Hackathon

Setup

Use the package manager pip to install dependencies.

pip install -r requirements.txt

set environment variables for your API key, username, and password to use the Socrata API. You'll need an NYC Open Data API key.

One way to set environment variables is with an .envrc file in your project directory. You can activate it with direnv.

Features

  • Topic modeling using Latent Dirichlet Allocation, SVD, t-SNE
  • Visualizations with pyLDAvis
  • Fuzzy matching with Cythonized implementation of sparse top-n similarity
  • NLP with gensim, sklearn, and spaCy

** Highly unfinished! ** TODO:

  • implement search and retrieval interface
  • expand dataset
  • tests
  • really, everything!

Contributing

Pull requests are welcome. For major changes, please open an issue first to discuss what you would like to change.

Please make sure to update tests as appropriate.

About


Languages

Language:HTML 70.7%Language:Python 26.2%Language:Jupyter Notebook 3.1%