Youhe Jiang's repositories
IJCAI2023-OptimalShardedDataParallel
[IJCAI2023] An automated parallel training system that combines the advantages from both data and model parallelism. If you have any interests, please visit/star/fork https://github.com/Youhe-Jiang/OptimalShardedDataParallel
apex
A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
AutoShard_Tool
a tool for recursive autoshard
ColossalAI
Colossal-AI: A Unified Deep Learning System for Big Model Era
ColossalAI-Examples
Examples of training models with hybrid parallelism using ColossalAI
DeepLabV3Plus-Pytorch
Pretrained DeepLabv3 and DeepLabv3+ for Pascal VOC & Cityscapes
DeepRL_PyTorch
Deep Reinforcement Learning codes for study. Currently, there are only codes for algorithms: DQN, C51, QR-DQN, IQN, QUOTA.
DeepSpeedExamples
Example models using DeepSpeed
examples
A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.
fast-style-transfer
TensorFlow CNN for fast style transfer β‘π₯π¨πΌ
FasterTransformer
Transformer related optimization, including BERT, GPT
FlexFlow
A distributed deep learning framework that supports flexible parallelization strategies.
horovod
Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.
llama
Inference code for LLaMA models
Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
msccl-tools
Synthesizer for optimal collective communication algorithms
nccl-tests
NCCL Tests
PipeSwitch
PipeSwitch: Fast Pipelined Context Switching for Deep Learning Applications
pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
pytorch-CycleGAN-and-pix2pix
Image-to-Image Translation in PyTorch
Pytorch-UNet
PyTorch implementation of the U-Net for image semantic segmentation with high quality images
transformers
π€ Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
yolov7
Implementation of paper - YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors
yolov7_d2
π₯π₯π₯π₯ (Earlier YOLOv7 not official one) YOLO with Transformers and Instance Segmentation, with TensorRT acceleration! π₯π₯π₯
Youhe-Jiang
Config files for my GitHub profile.