samthebest's repositories
aggregations
Fast accurate low memory type-safe aggregations DSL for Spark
agile-data-science-manifesto
Agile Manifesto for Data Science
basic-scala-project.g8
A giter8 template for generating a new Scala project.
lfs-warning
GitHub Action to detect Large Files in a Pull Request
Language:TypeScriptMIT000
omniture-data-tools
A set of tools for working with Omniture daily data files (hit_data.tsv) in big or small tools like Spark, Hadoop or just Python.
Language:JavaMIT000
Language:TeX000
Language:Scala000
Language:Shell000
sbt-release
A release plugin for sbt (>= 0.11.0)
scala-best-practices
A collection of Scala best practices
000
scala-build-files
A collection of templates / boiler plates build / project files / structures for quickly getting a Scala project up and running with various libraries
Language:Jupyter Notebook000
Language:Scala000
scalafmt
Code formatter for Scala
Apache-2.0000
spark-cloud
Spark-cloud is a set of scripts for starting spark clusters on ec2
Language:Shell000
table-tennis-rating-app
Genius!
Language:Scala000
Language:Scala000
Language:Scala000
zeppelin
Mirror of Apache Zeppelin
Language:JavaApache-2.0000