Zhao Hongyao's repositories
airflow-backfill-util
Airflow Backfill UI based plugin for existing / new Airflow environment
akka-cluster-on-kubernetes
Sample project for deploying Akka Cluster to Kubernetes. Presented at Scala Up North on July 21, 2017.
docker-java
Java Docker API Client
angel
A Flexible and Powerful Parameter Server for large-scale machine learning
awesome-ml-for-cybersecurity
:octocat: Machine Learning for Cyber Security
ballista
Distributed compute platform implemented in Rust, using Apache Arrow memory model.
conduit
The ultralight service mesh for Kubernetes
delta
An open-source storage layer that brings scalable, ACID transactions to Apache Spark™ and big data workloads.
fastText_java
Java port of c++ version of facebook fasttext
incubator-mxnet
Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more
JavaGuide
【Java学习+面试指南】 一份涵盖大部分Java程序员所需要掌握的核心知识。
leeml-notes
李宏毅《机器学习》笔记,在线阅读地址:https://datawhalechina.github.io/leeml-notes
mleap
MLeap: Deploy Spark Pipelines to Production
parquet-mr
Mirror of Apache Parquet
petastorm
Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet format. It supports ML frameworks such as Tensorflow, Pytorch, and PySpark and can be used from pure Python code.
pipeline
PipelineIO: End-to-End ML and AI Platform for Real-time Spark and Tensorflow Data Pipelines
pyspell
python log parser using "Spell: Streaming Parsing of System Event Logs"
scheduler
A Scala library for scheduling arbitrary code to run at an arbitrary time.
seldon-server
Enterprise machine learning platform for prediction and recommendation.
tensorflow
Computation using data flow graphs for scalable machine learning
tutorials
The "REST With Spring" Course:
vearch
A distributed system for embedding-based vector retrieval