Lijie Xu's starred repositories
TensorFlowOnSpark
TensorFlowOnSpark brings TensorFlow programs to Apache Spark clusters.
awesome-streaming
a curated list of awesome streaming frameworks, applications, etc
spark-ml-source-analysis
spark ml 算法原理剖析以及具体的源码实现分析
benchm-ml
A minimal benchmark for scalability, speed and accuracy of commonly used open source implementations (R packages, Python scikit-learn, H2O, xgboost, Spark MLlib etc.) of the top machine learning algorithms for binary classification (random forests, gradient boosted trees, deep neural networks etc.).
streaming-benchmarks
Benchmarks for Low Latency (Streaming) solutions including Apache Storm, Apache Spark, Apache Flink, ...
spark-knowledgebase
Spark Knowledge Base
grafana-spark-dashboards
Scripts for generating Grafana dashboards for monitoring Spark jobs
PostgresPrefs
Preference Pane for administering PostgreSQL on macOS
gelly-streaming
An experimental Graph Streaming API for Apache Flink
yahoo-streaming-benchmark
An extension of Yahoo's Benchmarks
tpch-spark
TPC-H queries in Apache Spark SQL using native DataFrames API
spark-ml-inventory
A curated inventory of machine learning methods available on the Apache Spark platform, both in official and third party libraries.
flink-perf
Flink performance tests
aliyun-spark-deploy-tool
Spark on ECS
mr-benchmarks
Modified benchmark from http://database.cs.brown.edu/projects/mapreduce-vs-dbms/
synthesize
Easy installer for Graphite and StatsD