Han Ju's repositories
go-mapreduce
Simple map-reduce implementation in Golang
go-replicated-log
Simple paxos-based replicated log implementation for learning purpose
powerline-shell-scala
A re-implementation of powerline-shell in Scala
akka-pagerank
Experimental page rank implementation in Akka
scala-exercises
Some interesting programs written in Scala
spark-dataflow
Provides a Spark backend for executing Dataflow pipelines.
spark-test
Some hands-on tests with Spark
SparkInternals
Notes talking about the design and implementation of Apache Spark
1brc
1️⃣🐝🏎️ The One Billion Row Challenge -- A fun exploration of how quickly 1B rows from a text file can be aggregated with Java
cluster-pack
A library on top of either pex or conda-pack to make your Python code easily available on a cluster
mini-lsm
A tutorial of building an LSM-Tree storage engine in a week!
pathy-dict
Python dictionary manipulations with paths
plotly-scala
Scala bindings for plotly.js
practical-reactor
Practical Project Reactor and reactive programing workshop
scalaz-task-intro
Introduction to Task, November 2014
simpleflow
Python library for dataflow programming.
spark-jobserver
REST job server for Spark