Forest Gregg (fgregg)

fgregg

Geek Repo

Company:@datamade, Partner

Location:Great Lakes

Home Page:https://bunkum.us

Twitter:@forestgregg

Github PK Tool:Github PK Tool


Organizations
datamade
dssg
open-city

Forest Gregg's starred repositories

python-goose

Html Content / Article Extractor, web scrapping lib in Python

Language:HTMLLicense:Apache-2.0Stargazers:3963Issues:202Issues:173

textdistance

📐 Compute distance between sequences. 30+ algorithms, pure python implementation, common interface, optional external libs usage.

Language:PythonLicense:MITStargazers:3337Issues:64Issues:0

computer-vision-basics-in-microsoft-excel

Computer Vision Basics in Microsoft Excel (using just formulas)

Mallet

MALLET is a Java-based package for statistical natural language processing, document classification, clustering, topic modeling, information extraction, and other machine learning applications to text.

Language:JavaLicense:NOASSERTIONStargazers:974Issues:85Issues:130

pglogical

Logical Replication extension for PostgreSQL 15, 14, 13, 12, 11, 10, 9.6, 9.5, 9.4 (Postgres), providing much faster replication than Slony, Bucardo or Londiste, as well as cross-version upgrades.

Language:CLicense:NOASSERTIONStargazers:966Issues:83Issues:413

leaflet-realtime

Put realtime data on a Leaflet map

Language:JavaScriptLicense:ISCStargazers:734Issues:46Issues:138

Pweave

Pweave is a scientific report generator and a literate programming tool for Python. It can capture the results and plots from data analysis and works well with numpy, scipy and matplotlib.

Language:PythonLicense:NOASSERTIONStargazers:435Issues:19Issues:120

pytest-flask-sqlalchemy

A pytest plugin for preserving test isolation in Flask-SQLAlchemy using database transactions.

Language:PythonLicense:MITStargazers:254Issues:8Issues:34

flock

flock(1) locks files

weighted-levenshtein

Weighted Levenshtein library

Language:PythonLicense:MITStargazers:105Issues:42Issues:23

things-cloud-sdk

golang client for the culturedcode things cloud

Language:GoLicense:MITStargazers:99Issues:7Issues:4

python-wheels-manylinux-build

GitHub Action to build Python manylinux wheels

Language:ShellLicense:Apache-2.0Stargazers:92Issues:5Issues:30

Levenshtein_search

Python search module for fast approximate string matching

Language:CLicense:GPL-3.0Stargazers:52Issues:7Issues:15

article-tagging

Natural Language Processing of Chicago news articles

Language:Jupyter NotebookLicense:MITStargazers:49Issues:13Issues:49

learned-string-alignments

Learning String Alignments for Entity Aliases

Language:PythonLicense:Apache-2.0Stargazers:38Issues:13Issues:2

json-to-multicsv

Split a JSON file with hierarchical data to multiple CSV files

Language:PerlLicense:MITStargazers:28Issues:3Issues:5

pysettrie

python3 package supporting efficient storage and querying of sets of sets using the trie data structure. Supports finding all the supersets/subsets of a given set from a collection of sets. Also includes a trie-based mapping container where the keys are sets.

Language:PythonLicense:LGPL-3.0Stargazers:24Issues:0Issues:8
Language:OpenEdge ABLStargazers:16Issues:0Issues:0

cafr-parsing

Automated data extraction from U.S. state Comprehensive Annual Financial Reports (CAFR).

Language:HTMLLicense:AGPL-3.0Stargazers:16Issues:9Issues:2

oasis

A Python package for efficient evaluation based on OASIS (Optimal Asymptotic Sequential Importance Sampling).

Language:PythonLicense:MITStargazers:14Issues:3Issues:0

graphical-record-linkage

A Python encapsulation of Steorts, et. al. (2015) graphical record linkage system

Language:HTMLLicense:CC0-1.0Stargazers:10Issues:1Issues:0

queer-civic-data

Materials for "Queer Communities, Civic Tech, and Open Data" workshop at MozFest 2018

stream-sample

sample streams using reservoir sampling

Language:JavaScriptStargazers:5Issues:0Issues:0

yoshiko

(Weighted) Cluster Editing

Language:C++License:MITStargazers:4Issues:0Issues:0

news-data-extraction

A repository of scripts for extracting news articles from US newspapers

Language:PythonStargazers:2Issues:1Issues:0

chicago-tree

Chicago tree related data

Haystack-SolrEnginePlus

Extending queryset and SolrBackend models for Django Haystack, that lets Django Haystack support Solr's Cursor Pagination, eDisMax(in progressing)

Language:PythonLicense:MITStargazers:2Issues:2Issues:0

lara-scraper

Scraper for the State of Michigan's Department of Licensing and Regulatory Affairs' business entity database

Language:PythonStargazers:2Issues:0Issues:0

1909

OCR of Chicago 1909 Renumbering Plan

Language:PythonStargazers:2Issues:3Issues:0
Language:OpenEdge ABLStargazers:1Issues:0Issues:0