zanussbaum / nyc-test-tracker

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

NYC COVID-19 Testing Wait Times PDF Scraper

request to create a scraper for nyc testing times

Challenge accepted!

This repo will automatically run the scraper every 15 minutes with a cronjob set up through GitHub Actions. Thanks to this wonderful blog post from Jason Etcovitch that had most of the action automation setup I pulled from for the workflow.

The csv filenames are structured as {two-hour time window}-{scrape timestamp}.csv.

Changelog

  • 2020-11-30 21:07: Data now includes the time window as a column as well as the scrape time. The latest data is also stored in latest.csv. Corrected some data cleaning issues with characters being parsed incorrectly and newlines being included in the wait times column.
  • 2020-11-30 21:14: Data moved to the data folder, copy of latest.csv included in root dir.

About


Languages

Language:Python 100.0%