Shiqing Fan (fanshiqing)

fanshiqing

Geek Repo

Company:NVIDIA

Location:Hangzhou, Zhejiang

Home Page:https://fanshiqing.github.io/

Github PK Tool:Github PK Tool

Shiqing Fan's repositories

grouped_gemm

PyTorch bindings for CUTLASS grouped GEMM.

Language:CudaLicense:Apache-2.0Stargazers:45Issues:0Issues:0

moe_grouped_gemm

A PyTorch Toolbox for Grouped GEMM in MoE Model Training

License:Apache-2.0Stargazers:3Issues:0Issues:0

MyLeetcodeSolutions

My leetcode/lintcode solutions in JAVA.

Language:JavaStargazers:1Issues:0Issues:0

Qix

Machine Learning、Deep Learning、PostgreSQL、Distributed System、Node.Js、Golang

License:NOASSERTIONStargazers:1Issues:2Issues:0

tensorflow-1

An Open Source Machine Learning Framework for Everyone

Language:C++License:Apache-2.0Stargazers:1Issues:1Issues:0

awesome-courses

:books: List of awesome university courses for learning Computer Science!

Stargazers:0Issues:0Issues:0

DAPPLE

An Efficiency Pipelined Data Parallel Approach for Large Models Training

Language:PythonStargazers:0Issues:1Issues:0
Language:HTMLStargazers:0Issues:0Issues:0

Megatron-LM

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0

tensorflow

Computation using data flow graphs for scalable machine learning

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

benchmarks

Benchmark code

Language:PythonLicense:Apache-2.0Stargazers:0Issues:2Issues:0

Best-websites-a-programmer-should-visit-zh

程序员应该访问的最佳网站中文版

License:MITStargazers:0Issues:0Issues:0

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective.

License:MITStargazers:0Issues:0Issues:0

gloo

Collective communications library with various primitives for multi-machine training.

Language:C++License:NOASSERTIONStargazers:0Issues:2Issues:0

GPT2

An implementation of training for GPT2, supports TPUs

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

gradient-checkpointing

Make huge neural nets fit in memory

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

hindsight_experience_replay

A tensorflow implementation of hindsight experience replay

Language:Jupyter NotebookStargazers:0Issues:1Issues:0

lingvo

Lingvo

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

mesh

Mesh TensorFlow: Model Parallelism Made Easier

Language:PythonLicense:Apache-2.0Stargazers:0Issues:2Issues:0

nccl-examples

NCCL Examples from Official NVIDIA NCCL Developer Guide.

Language:CMakeStargazers:0Issues:2Issues:0

nccl-tests

NCCL Tests

Language:CudaLicense:BSD-3-ClauseStargazers:0Issues:2Issues:0

nmt

TensorFlow Neural Machine Translation Tutorial

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0
License:MITStargazers:0Issues:0Issues:0

post--momentum

Why Momentum Really Works

Language:JavaScriptStargazers:0Issues:0Issues:0

rainbow-is-all-you-need

Rainbow is all you need! A step-by-step tutorial from DQN to Rainbow

License:MITStargazers:0Issues:0Issues:0

SGDLibrary

Matlab library for stochastic gradient descent algorithms: Version 1.0.12

Language:TerraLicense:MITStargazers:0Issues:0Issues:0

simplified-deeplearning

Simplified implementations of deep learning related works

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

tensor2tensor

Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

tensorflow-internals

It is open source ebook about TensorFlow kernel and implementation mechanism.

Language:TeXStargazers:0Issues:2Issues:0

YellowFin_Pytorch

auto-tuning momentum SGD optimizer

Language:PythonLicense:Apache-2.0Stargazers:0Issues:2Issues:0