tidycensus
tidycensus is an R package that allows users to interface with the US Census Bureau's decennial Census and five-year American Community APIs and return tidyverse-ready data frames, optionally with simple feature geometry included. Install from CRAN with the following command:
install.packages("tidycensus")
In version 0.9:
get_acs()
now defaults to the 2013-2017 five-year American Community Survey estimates, or the 2017 1-year estimates if users setsurvey = "acs1"
.- Various improvements and bug fixes.
In version 0.8.1:
-
tidycensus now includes support for the Census Bureau Population Estimates API. Please see https://walkerke.github.io/tidycensus/articles/other-datasets.html for examples of how to use these new features.
-
Important internal changes to ensure that tidycensus continues to work with upcoming changes to Census API endpoints.
-
Several internal improvements and bug fixes. Please see the closed issues at https://github.com/walkerke/tidycensus/issues for a list.
In version 0.4.6:
-
Bug fixed that was causing GEOIDs for some states to be converted to NA on certain Linux platforms
-
A new parameter,
shift_geo
, allows tidycensus users to get US state and county geometry originally obtained with the albersusa R package with Alaska and Hawaii shifted and re-scaled for better cartographic display of the entire US.
library(tidycensus)
library(tidyverse)
library(viridis)
us_county_income <- get_acs(geography = "county", variables = "B19013_001",
shift_geo = TRUE, geometry = TRUE)
ggplot(us_county_income) +
geom_sf(aes(fill = estimate), color = NA) +
coord_sf(datum = NA) +
theme_minimal() +
scale_fill_viridis_c()
Why tidycensus?
My work heavily involves the use of data from the US Census Bureau, and like many R users, I do most of my work within the tidyverse. Beyond this, the sf package now allows R users to work with spatial data in an integrated way with tidyverse tools, and updates to the tigris package provide access to Census boundary data as sf
objects. Recently, I've found myself writing the same routines over and over to get Census data ready for use with tidyverse packages and sf. This motivated me to wrap these functions in a package and open-source in case other R users find them useful.
tidycensus is designed to help R users get Census data that is pre-prepared for exploration within the tidyverse, and optionally spatially with sf. To learn more about how the package works, I encourage you to read the following articles:
Future development
To keep up with on-going development of tidycensus and get even more examples of how to use the package, subscribe to my email list by clicking here (no spam, I promise!). You'll also get updates on the development of my upcoming book with CRC Press, Analyzing the US Census with R.
You can also follow my blog at https://walkerke.github.io.
My development focus is on making the current datasets as accessible as possible; if you need other approaches or datasets, I recommend the censusapi and acs packages.
If you find this project useful, you can support package development in the following ways:
- Hiring me as a consultant to help you use tidycensus in your project, or hiring me to give a workshop on tidycensus for your organization. Please contact me at kwalkerdata@gmail.com if you are interested!
- Filing an issue - or even better, a pull request - at https://github.com/walkerke/tidycensus/issues.
Note: This product uses the Census Bureau Data API but is not endorsed or certified by the Census Bureau.