Tom Zeng's repositories
liblinear-ruby-swig
This is the Ruby interface to LIBLINEAR (much more efficient than LIBSVM for text classification and other large linear classifications)
ACE_Azure_ML
This repository contains training material related to Azure and Machine Learning
amazon-sagemaker-examples
Example notebooks that show how to apply machine learning and deep learning in Amazon SageMaker
azure-docs
Open source documentation of Microsoft Azure
azure-sqldb-spark
This project provides a client library that allows Azure SQL DB or SQL Server to act as an input source or output sink for Spark jobs.
dr-elephant
Dr. Elephant is a job and flow-level performance monitoring and tuning tool for Apache Hadoop and Apache Spark
drill-test-framework
Test Framework for Apache Drill
flink-sql-benchmark
TPC-DS benchmark tools for flink batch sql. Version 1.10 or above.
hdinsight-migration
HDInsight migration workshop content
incubator-mxnet
Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more
kinesis-sql
Kinesis Connector for Structured Streaming
mxnet-model-server
Model Server for Apache MXNet is a tool for serving neural net models for inference
nexmark
Benchmarks for queries over continuous data streams.
s3contents
A S3 backed ContentsManager implementation for Jupyter
scio
A Scala API for Apache Beam and Google Cloud Dataflow.
spark-terasort
Spark Terasort
streaming-at-scale
How to implement a streaming at scale solution in Azure
TensorFlowOnSpark
TensorFlowOnSpark brings TensorFlow programs onto Apache Spark clusters