A scraper that pulls COVID-19 Coronavirus data scraped from government and curated data sources.
This project exists to scrape, de-duplicate, and cross-check county-level data on the COVID-19 coronavirus pandemic.
Every piece of data produced includes the URL where the data was sourced from as well as a rating of the source's technical quality (completeness, machine readability, best practices -- not accuracy).
https://coronadatascraper.com/
We upload fresh data every day at around 9PM PST.
Check out our Getting Started guide to help get our project running on your local machine.
You can contribute to this project in two big ways:
Check the Issues for any task we need to get done. If you are new to open source, look for the label Good first issue
Contributions for any place in the world are welcome. See the community-curated list of verified data sources to find a new datasource to add, and be sure to update the "Scraped?" column when you do.
To help you contribute a new source, please read the Sources and Scrapers guide before you start!
Send a pull request with your scraper, and be sure to run the scraper first with the instructions specified in the guide to make sure the data is valid.
This project is licensed under the permissive BSD 2-clause license.
The data produced by this project is public domain.
Please cite this project if you use it in your visualization or reporting.