Analyzing GitHub Data with Spark Downloading data from GitHub Archive wget -P data/ http://data.githubarchive.org/2015-01-01-{0..23}.json.gz Run sbt run