TDT4305 Big Data Architecture Setup Install Spark and Hadoop. Unpack the ZIP-folder with the data. Run spark-submit main.py <input_data_path> <post_id>