Leyang Xue's repositories
drunkcoding.github.io
everything on distributed file system and cloud storage
model-inference
utilities and tests for model inference
alpa
Training and serving large-scale neural networks with auto parallelization.
cheetah-fastclick
FastClick with the Cheetah elements
CS411-Database-System
Project for database system -- an interactive website
DeeperSpeed
DeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective.
MIT-6.824-Distributed-System
Spring 2020
mixtral-offloading
Run Mixtral-8x7B models in Colab or consumer desktops
model-finetune
finetune pre-trained models
onnxruntime_backend
The Triton backend for the ONNX Runtime.
power-meter
A software power measurement tool for both CPU and GPU using vendor provided API
pytorch_backend
The Triton backend for the PyTorch TorchScript models.
simple-shell
Simple functioning shell implemented in C
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.