GeKeShi

followers

following

stars

Keshi_Ge's repositories

LSS_SGD

Language:Python1 20

Sketch_Pytorch

Communication efficient Pytorch with Sketch

Language:Python1 20

albert_pytorch

A Lite Bert For Self-Supervised Learning Language Representations

Language:PythonApache-2.0010

CASQ

Cluster-Aware Sketch Quantization

Language:Python020

Count-Sketch-Optimizers

A compressed adaptive optimizer for training large-scale deep learning models using PyTorch

Language:PythonApache-2.0010

csh

Simple Hierarchical Count Sketch in Python

Language:Python010

darts

Differentiable architecture search for convolutional and recurrent networks

Language:PythonApache-2.0020

DistributedTest

Some benchmark of distributed training

Language:Python010

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Language:PythonMIT010

fdml_dp_admm_sharing

Language:TeXGPL-3.0010

GeKeShi.github.io

Keshi's blog

020

GOPipe

GNN-Oriented Pipeline

Language:Python020

learning-to-quantize

Code for "Adaptive Gradient Quantization for Data-Parallel SGD", published in NeurIPS 2020.

Apache-2.0000

MachineLearning

Machine Learning in Action（机器学习实战）

Language:HTMLGPL-3.0020

Megatron-LM

Ongoing research training transformer models at scale

Language:PythonNOASSERTION000

NAO

Neural Architecture Optimization

Language:PythonGPL-3.0020

nccl

Optimized primitives for collective multi-GPU communication

Language:C++NOASSERTION010

pipedream

Language:PythonMIT010

pixyll

A simple, beautiful Jekyll theme that's mobile first

Language:CSSMIT020

powergossip

Code for "Practical Low-Rank Communication Compression in Decentralized Deep Learning"

Language:PythonMIT010

powersgd

Practical low-rank gradient compression for distributed optimization: https://arxiv.org/abs/1905.13727

MIT000

python-machine-learning-book

The "Python Machine Learning (1st edition)" book code repository and info resource

Language:Jupyter NotebookMIT020

pytorch-distributed

A quickstart and benchmark for pytorch distributed training.

Language:PythonMIT010

PyTorch_GBW_LM

PyTorch Language Model for 1-Billion Word (LM1B / GBW) Dataset

Apache-2.0000

sketchedsgd

Sketched SGD

Language:PythonGPL-3.0010

Sparse_Sketch_Reducer

Language:Python020

TensorFlow-Tutorials

Simple tutorials using Google's TensorFlow Framework

Language:Jupyter Notebook020

tensorflow_multigpu_imagenet

Tensorflow code for training different architectures(DenseNet, ResNet, AlexNet, GoogLeNet, VGG, NiN) on ImageNet dataset + Multi-GPU support + Transfer Learning support

Language:PythonMIT020

training-bottleneck

Analyze network performance in distributed training

Language:PythonApache-2.0010

tutorials

机器学习相关教程

Language:PythonMIT020