Matt Nicklay's starred repositories
OpenRefine
OpenRefine is a free, open source power tool for working with messy data and improving it
more-itertools
More routines for operating on iterables, beyond itertools
StarCluster
StarCluster is an open source cluster-computing toolkit for Amazon's Elastic Compute Cloud (EC2).
google-ngrams
Python scripts for retrieving CSV data from the Google Ngram Viewer and plotting it in XKCD style. The Python script for retrieving ngram data was originally modified from the script at www.culturomics.org.
BookwormDB
Tools for text tokenization and encoding