Forest Gregg's starred repositories
python-goose
Html Content / Article Extractor, web scrapping lib in Python
textdistance
📐 Compute distance between sequences. 30+ algorithms, pure python implementation, common interface, optional external libs usage.
computer-vision-basics-in-microsoft-excel
Computer Vision Basics in Microsoft Excel (using just formulas)
leaflet-realtime
Put realtime data on a Leaflet map
pytest-flask-sqlalchemy
A pytest plugin for preserving test isolation in Flask-SQLAlchemy using database transactions.
weighted-levenshtein
Weighted Levenshtein library
things-cloud-sdk
golang client for the culturedcode things cloud
python-wheels-manylinux-build
GitHub Action to build Python manylinux wheels
Levenshtein_search
Python search module for fast approximate string matching
article-tagging
Natural Language Processing of Chicago news articles
learned-string-alignments
Learning String Alignments for Entity Aliases
json-to-multicsv
Split a JSON file with hierarchical data to multiple CSV files
cafr-parsing
Automated data extraction from U.S. state Comprehensive Annual Financial Reports (CAFR).
graphical-record-linkage
A Python encapsulation of Steorts, et. al. (2015) graphical record linkage system
queer-civic-data
Materials for "Queer Communities, Civic Tech, and Open Data" workshop at MozFest 2018
stream-sample
sample streams using reservoir sampling
news-data-extraction
A repository of scripts for extracting news articles from US newspapers
chicago-tree
Chicago tree related data
Haystack-SolrEnginePlus
Extending queryset and SolrBackend models for Django Haystack, that lets Django Haystack support Solr's Cursor Pagination, eDisMax(in progressing)
lara-scraper
Scraper for the State of Michigan's Department of Licensing and Regulatory Affairs' business entity database