Rui Wang's repositories
apex
A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
DeepLearningExamples
Deep Learning Examples
DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
dlrover
DLRover: An Automatic Distributed Deep Learning System
Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
PERSIA
High performance distributed framework for training deep learning recommendation models based on PyTorch.
pytorch-lightning
The lightweight PyTorch wrapper for high-performance AI research. Scale your models, not the boilerplate.
gpu-burn
Multi-GPU CUDA stress test
llm-analysis
Latency and Memory Analysis of Transformer Models for Training and Inference
tensorboard
TensorFlow's Visualization Toolkit
tensorflow
An Open Source Machine Learning Framework for Everyone