qzweng

Qizhen WENG's repositories

clusterdata-cluster-trace-gpu-v2020-data

Language:Shell100

awesome-RLHF

A curated list of reinforcement learning with human feedback resources (continually updated)

Apache-2.0000

clusterdata

cluster data collected from production clusters in Alibaba for cluster management research

Language:Jupyter Notebook000

ColossalAI

Making large AI models cheaper, faster and more accessible

Language:PythonApache-2.0000

credentials-nodejs

Alibaba Cloud Credentials for TypeScript/Node.js

Language:TypeScriptMIT000

FastChat

An open platform for training, serving, and evaluating large languages. Release repo for Vicuna and FastChat-T5.

Language:PythonApache-2.0000

hkust-latex-thesis-template

A Better HKUST LaTeX Thesis Template

Language:TeX000

horovod

Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.

Language:PythonNOASSERTION000

open-simulator

K8s cluster simulator for capacity planning

Language:GoApache-2.0000

qzweng.github.io

My Academic Personal Pages:

Language:HTMLMIT000

skypilot

SkyPilot is a framework for easily running machine learning workloads on any cloud through a unified interface.

Language:PythonApache-2.0000

DeepPlan

Fast and Efficient Model Serving Using Multi-GPUs with Direct-Host-Access (ACM EuroSys '23)

Language:C++MIT000

FlexGen

Running large language models on a single GPU for throughput-oriented scenarios.

Apache-2.0000

incubator-mxnet

Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more

Language:PythonApache-2.0000

OpenAIOS vGPU device plugin for Kubernetes is originated from the OpenAIOS project to virtualize GPU device memory, in order to allow applications to access larger memory space than its physical capacity. It is designed for ease of use of extended device memory for AI workloads.

Apache-2.0000

qzweng

Qizhen WENG's repositories

clusterdata-cluster-trace-gpu-v2020-data

open-gpu-share

awesome-RLHF

clusterdata

ColossalAI

credentials-nodejs

FastChat

graph-learn-tracing

hkust-latex-thesis-template

horovod

open-simulator

qzweng.github.io

skypilot

credentials-python

DeepPlan

FlexGen

HeliosArtifact

incubator-mxnet

k8s-device-plugin

k8s-vgpu-scheduler

modelzoo

obsidian-things

ps-lite

qzweng

qzweng.github.io-202308

ray

seed_rl

typeset

vllm

xtuner