Frank Bertsch's repositories
queryparser
Parsing and analysis of Vertica, Hive, and Presto SQL.
emr-bootstrap-spark
AWS bootstrap scripts for Mozilla's flavoured Spark setup.
lua_sandbox_extensions
Extension packages (sandboxes and modules) for the lua_sandbox project
zeppelin
Mirror of Apache Zeppelin
ScalaPB
Protocol buffer compiler for Scala.
akka
Build highly concurrent, distributed, and resilient message-driven applications on the JVM
spark-hyperloglog
Algebird's HyperLogLog support for Apache Spark.
examples
Kubernetes application example tutorials
main_summary_check
Checks for the output of main_summary
experiments-viewer
Web API and front end dashboard UI for multi-variant experiment results
mozilla-reports
Repository for public analyses.
beautiful-jekyll
Build a beautiful and simple website in literally minutes. Demo at http://deanattali.com/beautiful-jekyll
redash
This is a Mozilla fork of the re:dash project (https://redash.io/), where we do work to be contributed back to the upstream project and for our own custom needs.
data-pipeline
Mozilla Services Data Pipeline
spark
Mirror of Apache Spark
mozping_explorer
Some simple functions to make exploring pings a bit easier
parquet2hive_server
A server and client that can remotely import parquet data into Hive
telemetry-analysis-service
Telemetry Analysis Service
presto
Distributed SQL query engine for big data
parquet-mr
As we have moved to Apache, please open your pull requests on: https://github.com/apache/parquet-mr
EventMap
Fork of EventMap for Indivisible Iowa
telemetry-docs
Firefox User Data Documentation
martingale-change-detector
A martingale approach to detect changes in Telemetry histograms
smart_open
Utils for streaming large files (S3, HDFS, gzip, bz2...)
moz-aws-cli
Some simple commands for utilizing our spark infrastructure