Spark Course for Uppsala University April 21, 2015 Ake Edlund and Izhar ul Hassan
Material based on AMPCamp and Databricks training material provided online under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0) license.
*** First time INSTRUCTIONS - From scratch, first time on a spark cluster
cd notebooks
git clone -b spark_course_uu https://github.com/databahn/spark_course.git
cd spark_course
./loaddata.sh
*** Check that it's there:
hdfs dfs -ls /uuData