guillermo-carrasco / pyspark-walkthrough

A brief intro to PySpark

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

PySpark walkthrough

This notebook walks you through the basics of PySpark and data manipulation with PySpark.

To run the notebook, install the requirements:

pip install -r requirements.txt

Python >= 3.5 is required, and I would recommend a clean environment to install it.

Have fun!

About

A brief intro to PySpark


Languages

Language:Jupyter Notebook 98.2%Language:Python 1.8%