danicat / spark101

Source code and support material for my talk Apache Spark 101

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Apache Spark 101

This repo contains the source code and support materials for my talk Apache Spark 101.

Setup

In order to run the example you must first install pyspark:

python3 -m venv venv
source venv/bin/activate
pip install pyspark

The setup.sh file is also provided for convenience.

Talks

  1. CapiConf 2020
  2. DevFest 2020
  3. TDC Recife 2020

Contact

If you have any questions, please reach out to me at daniela.petruzalek@gmail.com. I'm also on Twitter as @danicat83.

About

Source code and support material for my talk Apache Spark 101

License:GNU General Public License v3.0


Languages

Language:Jupyter Notebook 99.8%Language:Python 0.2%