Shiqing Fan's repositories
grouped_gemm
PyTorch bindings for CUTLASS grouped GEMM.
moe_grouped_gemm
A PyTorch Toolbox for Grouped GEMM in MoE Model Training
MyLeetcodeSolutions
My leetcode/lintcode solutions in JAVA.
tensorflow-1
An Open Source Machine Learning Framework for Everyone
awesome-courses
:books: List of awesome university courses for learning Computer Science!
Megatron-LM
Ongoing research training transformer language models at scale, including: BERT & GPT-2
tensorflow
Computation using data flow graphs for scalable machine learning
benchmarks
Benchmark code
Best-websites-a-programmer-should-visit-zh
程序员应该访问的最佳网站中文版
DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective.
gradient-checkpointing
Make huge neural nets fit in memory
hindsight_experience_replay
A tensorflow implementation of hindsight experience replay
nccl-examples
NCCL Examples from Official NVIDIA NCCL Developer Guide.
nccl-tests
NCCL Tests
post--momentum
Why Momentum Really Works
rainbow-is-all-you-need
Rainbow is all you need! A step-by-step tutorial from DQN to Rainbow
SGDLibrary
Matlab library for stochastic gradient descent algorithms: Version 1.0.12
simplified-deeplearning
Simplified implementations of deep learning related works
tensor2tensor
Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.
tensorflow-internals
It is open source ebook about TensorFlow kernel and implementation mechanism.
YellowFin_Pytorch
auto-tuning momentum SGD optimizer