amalaj7 / Pyspark-Notes

This repository contains the Notes for Pyspark

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Prerequisite : For you to execute these notebooks, you have to setup the Pyspark in Local Mode or you can log in to Databricks Platform and execute in their environment.

You can refer the below blog to setup the local mode installation and to execute programs in jupyter notebook.

https://sigdelta.com/blog/how-to-install-pyspark-locally/

For Architecture and Documentation

Refer This Link For Detailed Architecture of Pyspark => SparkTutorial

You can also refer the official documentation of Spark(I'm using 3.0.2) => officalDoc

About

This repository contains the Notes for Pyspark


Languages

Language:Jupyter Notebook 100.0%