NYC COVID-19 Testing Wait Times PDF Scraper
Challenge accepted!
This repo will automatically run the scraper every 15 minutes with a cronjob set up through GitHub Actions. Thanks to this wonderful blog post from Jason Etcovitch that had most of the action automation setup I pulled from for the workflow.
The csv filenames are structured as {two-hour time window}-{scrape timestamp}.csv
.
Changelog
- 2020-11-30 21:07: Data now includes the time window as a column as well as the scrape time. The latest data is also stored in
latest.csv
. Corrected some data cleaning issues with characters being parsed incorrectly and newlines being included in the wait times column. - 2020-11-30 21:14: Data moved to the
data
folder, copy oflatest.csv
included in root dir.