Sandy Ryza's repositories
spark-timeseries
A library for time series analysis on Apache Spark
spark-ts-examples
Spark TS Examples
simplesparkapp
Simple Spark Application
awesome-bigdata
A curated list of awesome big data frameworks, ressources and other awesomeness.
covid-social-distancing
Social Distancing Metrics in the Age of COVID-19
scala-project-template
I start Scala projects by copying this
awesome-production-machine-learning
A curated list of awesome open source libraries to deploy, monitor, version and scale your machine learning
branchreduce
Distributed branch-and-bound on Hadoop YARN.
awesome-opensource-data-engineering
An Awesome List of Open-Source Data Engineering Projects
exercise-classes
Class attendance analysis
jaffle_shop
A self-contained dbt project for testing purposes
knowledge-repo
A next-generation curated knowledge sharing platform for data scientists and other technical professions.
LSP-ruff
LSP helper for ruff - an extremely fast Python linter, written in Rust.
postgres-util
Utilities for moving data around with postgres