Liang Tang's repositories
6.824-raft
mit 6.824 raft 协议的完整实现
chaos-mesh
A Chaos Engineering Platform for Kubernetes.
dlrover
DLRover: An Automatic Distributed Deep Learning System
internal-acls
Repository used to main group ACLs used by Kubeflow developers
kineto
A CPU+GPU Profiling library that provides access to timeline traces and hardware performance counters.
koordinator
QoS based scheduling system for hybrid orchestration workloads on Kubernetes, bringing workloads the best layout and status.
kube-batch
A batch scheduler of Kubernetes for ML/BigData/HPC workload
kubernetes
Production-Grade Container Scheduling and Management
mpi-operator
Kubernetes Operator for Allreduce-style Distributed Training
NPKit
NCCL Profiling Kit
p5.js-web-editor
In progress p5.js web editor, coming soon.
paddle-operator
Elastic Deep Learning Training based on Kubernetes by Leveraging EDL and Volcano
scheduler-plugins
Repository for out-of-tree scheduler plugins based on scheduler framework.
tf-operator
Tools for ML/Tensorflow on Kubernetes.