something to help you spark
This is a library of reusable code for Spark applications, factored out of applications we've built at Red Hat. It will grow in the future but for now we have an application skeleton, a simple natural join for data frames, and a histogramming RDD.
Add the following resolver to your project:
resolvers += "Will's bintray" at "https://dl.bintray.com/willb/maven/"
and then add Silex as a dependency:
libraryDependencies += "com.redhat.et" %% "silex" % "0.0.8"
The Silex web site includes some examples of Silex functionality in use and API docs.