Wang Zhang's repositories
controller-runtime
Repo for the controller-runtime subproject of kubebuilder (sig-apimachinery)
crane
Crane is a FinOps Platform for Cloud Resource Analytics and Economics in Kubernetes clusters. The goal is not only help user to manage cloud cost easier but also ensure the quality of applications. https://gocrane.io/
DALI
A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.
fairscale
PyTorch extensions for high performance and large scale training.
flownet2-pytorch
Pytorch implementation of FlowNet 2.0: Evolution of Optical Flow Estimation with Deep Networks
internal-acls
Repository used to main group ACLs used by Kubeflow developers
kubeflow-operator
temporary try for reconciler package in kubeflow/common
kubeflow-testing
Test infrastructure and tooling for Kubeflow.
ml-demo
Machine learning demo for skai community.
mpi-operator
Kubernetes Operator for Allreduce-style Distributed Training
p4app-switchML
Switch ML Application
sample-serving-controller
Repository for sample controller. Complements sample-apiserver
scheduler-plugins
Repository for out-of-tree scheduler plugins based on scheduler framework.
simple-kubernetes-webhook
This project is aimed at illustrating how to build a fully functioning kubernetes admission webhook in the simplest way possible.
tensorflow
An Open Source Machine Learning Framework for Everyone
tf-operator
Tools for ML/Tensorflow on Kubernetes.
torch-rpc-on-k8s
dynamically manage remote remote rpc objects in pytorch on k8s