rhasanm / being-data-engineer

Home Page:https://rhasanm.github.io/being-data-engineer/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

being-data-engineer

Languages

sdk install scala 2.12.18 # for spark 3.5.0

Tools

cd tools
wget https://dlcdn.apache.org/spark/spark-3.5.0/spark-3.5.0-bin-hadoop3.tgz
tar xzvf spark-3.5.0-bin-hadoop3.tgz
cd spark-3.5.0-bin-hadoop3
echo 'export PATH=$PATH:'"$(pwd)"/bin >> ~/.zshrc
echo 'export SPARK_SHELL='"$(pwd)" >> ~/.zshrc

About

https://rhasanm.github.io/being-data-engineer/


Languages

Language:Jupyter Notebook 80.4%Language:Python 18.3%Language:Makefile 1.2%