Chen Wang's repositories
autoscaler_predictor_model
The data synthesizer, forecaster and predictor model server used together with KEDA scaler or cluster autoscaler to achieve cluster autoscaling with diurnal pattern workload
OSSNA23Demo
The demo to benchmark energy consumption of FMaaS with GPU energy conservation.
Kepler-Demo
Manifests, Documents, Tools used in Kepler Demos
load-watcher
Load watcher is a cluster-wide aggregator of metrics, developed for Trimaran: Real Load Aware Scheduler in Kubernetes.
openheygen
HeyGen's open source solution
scheduler-plugins
Repository for out-of-tree scheduler plugins based on scheduler framework.
code-generator
Generators for kube-like API types
DALLE2-pytorch
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch
dspy
DSPy: The framework for programming—not prompting—foundation models
embedchain
Framework to easily create LLM powered bots over any dataset.
kepler
Kepler (Kubernetes-based Efficient Power Level Exporter) uses eBPF to probe energy related system stats and exports as Prometheus metrics
kube-scheduler-simulator
A web-based simulator for the Kubernetes scheduler
kubernetes
Production-Grade Container Scheduling and Management
kubernetes-autoscaler-1
Autoscaling components for Kubernetes
langchain-ask-pdf
An AI-app that allows you to upload a PDF and ask questions about it. It uses OpenAI's LLMs to generate a response.
leaderboard
The leaderboard code for benchmarking LLM models.
magvit
Official JAX implementation of MAGVIT: Masked Generative Video Transformer
pushing-netperf-metrics-to-prometheus
Repository for the netperf component used by the Network-Aware framework for the Kubernetes platform based the Scheduler Framework
secondary-scheduler-operator
Red Hat Certified optional operator for secondary schedulers
Seine-HelloWorld
The testing repo for Seine Bot
sglang
SGLang is a structured generation language designed for large language models (LLMs). It makes your interaction with models faster and more controllable.
test-infra
Test infrastructure for the Kubernetes project.
trimaran-kubecon24
The demo scripts for Trimaran schedulers presented at KubeCon EU 2024 at Paris.
vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
vllm-router
vLLM Router
wangchen615.github.io
Chen Wang's personal website
wg-env-sustainability
🌳🌍♻️ Environmental Sustainability Working Group