Nan Zhu's repositories
benchm-ml
A minimal benchmark for scalability, speed and accuracy of commonly used open source implementations (R packages, Python scikit-learn, H2O, xgboost, Spark MLlib etc.) of the top machine learning algorithms for binary classification (random forests, gradient boosted trees, deep neural networks etc.).
benchmarkingMapDB
a benchmark program for evaluating MapDB performance
CustomerTests
example code to show
dmlc.github.io
the homepage http://dmlc.ml
dr-elephant
Performance monitoring and tuning tool for Apache Hadoop
eventhubs-client
A generic Java client for Microsoft Azure EventHubs
LearningHaskell
personal repo for learning Haskell
McGill-COMP535-Fall-2015
COMP 535: Computer Network
peloton
The Self-Driving Database Management System
perf-map-agent
A java agent to generate method mappings to use with the linux `perf` tool
script-actions
script actions in powershell and bash to install/update new components on HDInsight clusters
spark-streaming-data-persistence-examples
Examples showing how streaming events can be persisted to Azure blob, Hive table and Azure SQL Table through Spark.
streaming-benchmarks
Benchmarks for Low Latency (Streaming) solutions including Apache Storm, Apache Spark, Apache Flink, ...
tensorframes
Tensorflow wrapper for DataFrames on Apache Spark
test_timestamp
test_timestamp
typescript_learn
start learning typescript
xgboost_test
integration test for xgboost