BuzzFeedNews / 2020-05-covid-city-zip-codes

Data, code, and methodology supporting the BuzzFeed News' analysis of COVID-19 ZIP codes and demographic trends.

Home Page:https://www.buzzfeednews.com/article/johntemplon/detailed-coronavirus-maps-by-zip-code-major-cities

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

US City COVID-19 ZIP Code Analysis

This repository contains data and code supporting a BuzzFeed News article about city-level ZIP code demographics and COVID-19 cases, published May 7, 2020. See below for details.

Data

ZIP code–level data

The analysis uses ZIP–code level case counts (as of May 4, 2020 for each city except for Detroit, which is as of May 7, 2020) for the following five cities, stored in the data/raw directory:

City ZIP shapefiles

The data/raw directory also includes ZIP-code level shapefiles for each of the five cities. Those geospatial files come from each city's open data portals and are used to filter for the appropriate ZIP codes and create the maps that are included in the article.

Census data

The demographic data used in the analysis comes from the 5-year ACS estimates for 2018 at the ZCTA level. The data file included in this repository has been pre-processed from seven different data files that are not included here, in order to reduce the size of the raw data in this repository.

CBSA and county data

The data/county-data directory contains several datasets relevant to the metro-area calculations described below. The datasets are:

Analysis

Analyze ZIP code–level COVID-19 case / demographic correlations

The city-demographic-factors-analysis.ipynb notebook loads the data for each city, calculates correlations between various demographic factors and per capita case counts (at a ZIP code level), and graphically explores some of those correlations. It also outputs the GeoJSON and CSV files used to create the maps and scatterplots in the story.

Calculate population, cases, and deaths in largest US metro areas

The calculate-metro-area-proportions.ipynb notebook uses the county-level data described above to calculate the proportion of population, COVID-19 cases, and COVID-19 deaths in the United States' 15 largest metro areas, relative to US totals.

Licensing

All code in this repository is available under the MIT License. Files in the output/ directory are available under the Creative Commons Attribution 4.0 International (CC BY 4.0) license.

Contact

If you have any questions about this repository you can reach out to John Templon at john.templon@buzzfeed.com.

Looking for more from BuzzFeed News? Click here for a list of our open-sourced projects, data, and code.

About

Data, code, and methodology supporting the BuzzFeed News' analysis of COVID-19 ZIP codes and demographic trends.

https://www.buzzfeednews.com/article/johntemplon/detailed-coronavirus-maps-by-zip-code-major-cities


Languages

Language:Jupyter Notebook 100.0%