Jeremy Singer-Vine's repositories
pdfplumber
Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.
waybackpack
Download the entire Wayback Machine archive for a given URL.
notebookjs
Render Jupyter/IPython notebooks on the fly, in the browser. (Or on the command line, if you'd like.)
weightedcalcs
Pandas-based utility to calculate weighted means, medians, distributions, standard deviations, and more.
intro-to-visidata
Source files for "An Introduction to VisiData"
visidata-plugins
A place for me to share VisiData plugins I've written.
visidata-cheat-sheet
A one-page cheat sheet for VisiData, available in multiple languages.
buzzfeed-news-trending-strip
Dataset: BuzzFeed News “Trending” Strip, 2018–2023
tab-bankrupter
A Chrome extension for declaring "tab bankruptcy" without losing all your links.
fbpagefeed
A library and command-line tool for fetching Facebook Pages' published posts.
nicar-2018-schedule
Your unofficial guide to what's happening next at NICAR 2018.
nicar-2019-schedule
The NICAR 2019 conference schedule as JSON and CSV files, plus the underlying Python scraper.
pdfminer.six
Community maintained fork of pdfminer
warn-scraper
Command-line interface for downloading WARN Act notices of qualified plant closings and mass layoffs from state government websites
aphis-inspection-reports-flags
Citations data from USDA's Animal and Plant Health Inspection Service, flagged for various phenomenons of public interest.
mortgage-application-analysis-for-futuro-investigates
Code and data supporting Futuro Investigates’ examination of mortgage application outcomes in New Jersey
SDI-Health
Dissemination of harmonization code and data for SDI Health surveys