Sara Stoudt's starred repositories
QuorseNotes
Course notes and slides in the same file, facilitated by Quarto
when_police_replication
Replication and re-analysis repository for Nix et al. "When police pull back: Neighborhood-level effects of de-policing on violent and property crime, a research note." For more information contact Jacob Kang-Brown, jkangbrown@vera.org
a-physical-book
For National Novel Generation Month 2017
message-book
make a book from imessages
dh-lib-recipe
Basic code for Feeling Data and Affective Solidarity in the dh+lib issue on critical making and physical data
butterfly-engagement-reproducible
Reproducible code for estimating rates of over and underreporting in iNaturalist observations of butterflies
police_pursuits
A repository for data assembled by the San Francisco Chronicle for its 2024 investigation into fatal police pursuits across the U.S. Reporting and data analysis by Jennifer Gollan and Susie Neilson.
BioSCAN_2023
Code and data used to produce the results found in Lewthwaite et. al. 2023.
R_advent_calendaR
The original R advent calendaR that teaches the basics in R
plastics-prototype
Decision support tool for plastics policy.
poem-parser
Parsing poem texts in csvs for sampling
boating-accident-reports-1995-2012-muckrock
Records from the US Coast Guard’s database of recreational boating accident reports, obtained by MuckRock's Michael Morisy in 2013, converted here to CSV.
va-ssvf-survey-data
Data from the Department of Veteran Affairs (VA) Supportive Services for Veteran Families (SSVF) satisfaction surveys, obtained via FOIA.
inat-user-behavior
Analyses exploring spatial, temporal, taxonomic, and user biases in iNaturalist data
stringr-things
A dataset containing all dialogue from the Stranger Things series up to Season 4
the-seinfeld-chronicles
A dataset for textual analysis on arguably the best written comedy television show ever.
TheKingdomofNewbR
A fun, tiny little tutorial about traveling into the Kingdom of Rstats
fictional-time-with-GPT4
An experiment replicating part of "Why Literary Time is Measured in Minutes" with GPT-4.
StatsBehindTheHeadlines
Book: Statistics Behind the Headlines - code and data
data-ppf.github.io
website, lectures, and other material for Columbia University course "data: past, present, and future"
cinandofestivals
Code and data to replicate the analyses in the "Quantifying the global film festival circuit" paper