Ethan Yao's repositories
codechella
Data, Code and other material for CodeChella concert
congress-legislators
Members of the United States Congress, 1789-Present, in YAML/JSON/CSV, as well as committees, presidents, and vice presidents.
d6tstack
Quickly ingest messy CSV and XLS files. Export to clean pandas, SQL, parquet
data-police-shootings
The Washington Post is compiling a database of every fatal shooting in the United States by a police officer in the line of duty since 2015.
funnyfed
Visualizing the amount of laughter at FOMC meetings
msi-ipython-nb-ex
Example IPython notebooks for MSI
pq_parser
Script to parse text file downloads from ProQuest's Global Newsstream database into CSV of metadata and full text.
PredictingFireRisk
A #rspatial workshop on predicting fire risk in San Francisco
r-code
Mostly R code files for my posts on www.returnandrisk.com.
RapidFuzz
Rapid fuzzy string matching in Python using various string metrics
sigma_coding_youtube
This is a collection of all the code that can be found on my YouTube channel Sigma Coding.
text_similarity
Text Similarity