piyush1711 / pySpark_tutorial

Implementation of Spark code in Jupyter notebook. Topics include: RDDs and DataFrame, exploratory data analysis (EDA), handling multiple DataFrames, visualization, Machine Learning

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

pySpark_tutorial

List of contents

  • RDDs and DataFrame
  • Exploratory data analysis
  • Handeling multiple dataframes
  • Visualization
  • Machine learning

About

Implementation of Spark code in Jupyter notebook. Topics include: RDDs and DataFrame, exploratory data analysis (EDA), handling multiple DataFrames, visualization, Machine Learning


Languages

Language:Jupyter Notebook 100.0%