Practicing Apache Spark with Python. Starting from word count program. Finally creating Movie Recommender System on Amazon EC2 Clusters Data source: https://grouplens.org/datasets/movielens/ For Local system you can use Data Set "MovieLens 100K Dataset"(Smaller DataSet) once you start working on clusters you can pick up "MovieLens 1M Dataset" or "10M" dataset. You will need Python 2.x, A Python Editior recommended[PyCharm, Canopy, JupyterNotebook], Apache Spark Latest Version, Amazon AWS cloudservice access[Not Necessary]