Kris Shaffer's starred repositories
PythonDataScienceHandbook
Python Data Science Handbook: full text in Jupyter Notebooks
wayback-machine-downloader
Download an entire website from the Wayback Machine.
arxiv-sanity-preserver
Web interface for browsing, search and filtering recent arxiv submissions
congress-legislators
Members of the United States Congress, 1789-Present, in YAML/JSON/CSV, as well as committees, presidents, and vice presidents.
CourseraML
I took Andrew Ng's Machine Learning course on Coursera and did the homework assigments... but, on my own in python because I love jupyter notebooks!
read-this-first
Start here!
botometer-python
A Python API for Botometer by OSoMe
wordVectors
An R package for creating and exploring word2vec and other word embedding models
facebook-political-ads
Monitoring Facebook Political Ads
nyc-stabilization-unit-counts
Scrape unit counts for NYC rent stabilized apts from tax bills
theguardian-api-python
Python client for thegaurdian api
2017-08-partisan-sites-and-facebook-pages
Data, analytic code, and findings related to the BuzzFeed News article, "Inside The Partisan Fight For Your News Feed," published August 8, 2017.
RDRPOSTagger
R package for Ripple Down Rules-based Part-Of-Speech Tagging (RDRPOS). On more than 45 languages.
doppelganger-finder
Doppelganger-finder finds multiple accounts (doppelgangers) of a user.
collect-social
Simply collect social media content
discursive
Twitter topic search and indexing with Elasticsearch
frontpages
Analysis of the front page of newspapers
open-data-sets
Open data sets scraped, cleaned, and/or FOIA'd by Chicago, for Chicago
twitter_tools
Scraping, network analysis, NLP, etc.