recommender

Based on an example presented in "Apache Spark 2.0 with Scala - Hands On with Big Data!"

Instructions to run recommender on AWS EMR.

Leave the SparkContext unconfigured. We want to use EMR's defaults.

Use sbt-assembly. First, add it to the file project/plugins.sbt:

addSbtPlugin("com.eed3si9n" % "sbt-assembly" % "0.14.5")

Remember to specify that spark is provided. In build.sbt:

"org.apache.spark" %% "spark-core" % "2.2.0" % "provided"

Run the command sbt assembly. The fat jar should be in target/scala-2.11.

Upload data and fat jar to s3 bucket so that they can be retrieved from the cluster.

remember to change security to accept ssh connections from master
ssh to master node
download fat jar and names file

aws s3 cp s3://lum-ai-bucket/ml-1m/recommender-assembly-1.0.jar . aws s3 cp s3://lum-ai-bucket/ml-1m/movies.dat .

spark-submit recommender-assembly-1.0.jar