Shigang Li (Shigangli)

Shigangli

Geek Repo

Company:Beijing University of Posts and Telecommunications

Location:Beijing, China

Home Page:https://shigangli.github.io/

Twitter:@shigang_li

Github PK Tool:Github PK Tool

Shigang Li's repositories

Magicube

Magicube is a high-performance library for quantized sparse matrix operations (SpMM and SDDMM) of deep learning on Tensor Cores.

Language:C++License:GPL-3.0Stargazers:81Issues:4Issues:2

Chimera

Chimera: Efficiently Training Large-Scale Neural Networks with Bidirectional Pipelines.

Language:PythonLicense:GPL-3.0Stargazers:44Issues:2Issues:3

Ok-Topk

Ok-Topk is a scheme for distributed training with sparse gradients. Ok-Topk integrates a novel sparse allreduce algorithm (less than 6k communication volume which is asymptotically optimal) with the decentralized parallel Stochastic Gradient Descent (SGD) optimizer, and its convergence is proved theoretically and empirically.

Language:PythonLicense:GPL-3.0Stargazers:23Issues:2Issues:4

eager-SGD

Eager-SGD is a decentralized asynchronous SGD. It utilizes novel partial collectives operations to accumulate the gradients across all the processes.

Language:PythonLicense:Apache-2.0Stargazers:8Issues:3Issues:0

COMPI

Cache-oblivious MPI all-to-all communications based on Morton order

Language:CLicense:GPL-3.0Stargazers:3Issues:2Issues:0

DNN-cpp-proxies

C++/MPI proxies for distributed training of deep neural networks.

Language:C++License:GPL-3.0Stargazers:1Issues:3Issues:0

akg

AKG (Auto Kernel Generator) is an optimizer for operators in Deep Learning Networks, which provides the ability to automatically fuse ops with specific patterns.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

bigbird-1

Google's BigBird (Jax/Flax & PyTorch) @ 🤗Transformers

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

brian2

Brian is a free, open source simulator for spiking neural networks.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0

ColossalAI

Making large AI models cheaper, faster and more accessible

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

CuAssembler

An unofficial cuda assembler, for all generations of SASS, hopefully :)

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

dace

DaCe - Data Centric Parallel Programming

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:1Issues:0

dlrm

An implementation of a deep learning recommendation model (DLRM)

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

dlrover

DLRover: An Automatic Distributed Deep Learning System

License:NOASSERTIONStargazers:0Issues:0Issues:0

examples

A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:1Issues:0

flax

Flax is a neural network library for JAX that is designed for flexibility.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

legion

The Legion Parallel Programming System

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

longformer

Longformer: The Long-Document Transformer

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

mindspore

MindSpore is a new open source deep learning training/inference framework that could be used for mobile, edge and cloud scenarios.

Language:C++License:Apache-2.0Stargazers:0Issues:1Issues:0

minGPT

A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training

Language:Jupyter NotebookLicense:MITStargazers:0Issues:1Issues:0
Language:PythonStargazers:0Issues:1Issues:0

oneDNN

oneAPI Deep Neural Network Library (oneDNN)

Language:C++License:Apache-2.0Stargazers:0Issues:1Issues:0

p4app-switchML

Switch ML Application

Language:C++License:Apache-2.0Stargazers:0Issues:0Issues:0

ROCm

ROCm - Open Source Platform for HPC and Ultrascale GPU Computing

Stargazers:0Issues:1Issues:0

shigangli.github.io

Homepage of Shigang Li https://shigangli.github.io/

Language:HTMLStargazers:0Issues:2Issues:0

sparsegpt

Code for the paper "SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot".

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:C++Stargazers:0Issues:1Issues:0
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:1Issues:0

XiangShan

Open-source high-performance RISC-V processor

Language:ScalaLicense:NOASSERTIONStargazers:0Issues:1Issues:0

yapf

A formatter for Python files

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0