Big data takes many forms. Sometimes it’s large CSV files on your computer. Sometimes it’s data in an external database. Sometimes it’s data in a corporate data lake. Here, we will look at how to use the power of dplyr and other R packages to work with big data in various formats to arrive at meaningful insight using a familiar and consistent set of tools.
This repository contains a collection of slides and example code used for talks and workshops given about working with Big Data in R.
This repository uses the
renv
package for
dependency management. The necessary R dependencies can be installed by running
renv::restore()
.