essoen / TDT4305

Code and report for our project in TDT4305 BIg Data Architecture at NTNU spring 2016

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

TDT4305 Big Data Architecture

Setup

  1. Install and setup Python 2.7, Spark and Pyspark.
  2. Unpack the ZIP-files with the data.

Run

  1. spark-submit task1/foursquare.py input_data_path
  2. spark-submit task2/twitter.py input_data_file output_data_file.

About

Code and report for our project in TDT4305 BIg Data Architecture at NTNU spring 2016

License:MIT License


Languages

Language:Python 51.6%Language:TeX 48.4%