March, 2022
This project provides a synopsis of a company’s big data pipeline, investigates data in a sample database, and summarizes NFL data using the Pandas API on Spark.
Citations for part 1 have been added using jupyterlab-citation-manager.
To export these notebooks as HTML with embedded images, install jupyter_contrib_nbextensions by running
$ pip install jupyter_contrib_nbextensions
$ pip install jupyter_nbextensions_configurator
$ jupyter contrib nbextension install --user
$ jupyter nbextensions_configurator enable --user