licsh / pyspark-tutorial

PySpark-Tutorial provides basic algorithms using PySpark

Home Page:http://mapreduce4hackers.com

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

PySpark Tutorial

  • PySpark is the Python API for Spark.
  • The purpose of PySpark tutorial is to provide basic distributed algorithms using PySpark.
  • PySpark has an interactive shell ($SPARK_HOME/bin/pyspark) for basic testing and debugging and is not supposed to be used for production environment.
  • You may use $SPARK_HOME/bin/spark-submit command for running PySpark programs (may be used for testing and production environemtns)

PySpark Examples and Tutorials

PySpark Tutorial and References...

Questions/Comments

Thank you!

best regards,
Mahmoud Parsian

PySpark Algorithms Book

Data Algorithms Book

About

PySpark-Tutorial provides basic algorithms using PySpark

http://mapreduce4hackers.com

License:Other


Languages

Language:Python 74.6%Language:Shell 25.4%