Keshi_Ge's repositories

Language:PythonStargazers:1Issues:2Issues:0

Sketch_Pytorch

Communication efficient Pytorch with Sketch

Language:PythonStargazers:1Issues:2Issues:0

albert_pytorch

A Lite Bert For Self-Supervised Learning Language Representations

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

CASQ

Cluster-Aware Sketch Quantization

Language:PythonStargazers:0Issues:2Issues:0

Count-Sketch-Optimizers

A compressed adaptive optimizer for training large-scale deep learning models using PyTorch

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

csh

Simple Hierarchical Count Sketch in Python

Language:PythonStargazers:0Issues:1Issues:0

darts

Differentiable architecture search for convolutional and recurrent networks

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

DistributedTest

Some benchmark of distributed training

Language:PythonStargazers:0Issues:0Issues:0

fairseq

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0
Language:TeXLicense:GPL-3.0Stargazers:0Issues:1Issues:0

GeKeShi.github.io

Keshi's blog

Stargazers:0Issues:0Issues:0

GOPipe

GNN-Oriented Pipeline

Language:PythonStargazers:0Issues:2Issues:0

learning-to-quantize

Code for "Adaptive Gradient Quantization for Data-Parallel SGD", published in NeurIPS 2020.

License:Apache-2.0Stargazers:0Issues:0Issues:0

MachineLearning

Machine Learning in Action(机器学习实战)

Language:HTMLLicense:GPL-3.0Stargazers:0Issues:2Issues:0

Megatron-LM

Ongoing research training transformer models at scale

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

NAO

Neural Architecture Optimization

Language:PythonLicense:GPL-3.0Stargazers:0Issues:2Issues:0

nccl

Optimized primitives for collective multi-GPU communication

Language:C++License:NOASSERTIONStargazers:0Issues:1Issues:0
Language:PythonLicense:MITStargazers:0Issues:1Issues:0

pixyll

A simple, beautiful Jekyll theme that's mobile first

Language:CSSLicense:MITStargazers:0Issues:2Issues:0

powergossip

Code for "Practical Low-Rank Communication Compression in Decentralized Deep Learning"

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

powersgd

Practical low-rank gradient compression for distributed optimization: https://arxiv.org/abs/1905.13727

License:MITStargazers:0Issues:0Issues:0

python-machine-learning-book

The "Python Machine Learning (1st edition)" book code repository and info resource

Language:Jupyter NotebookLicense:MITStargazers:0Issues:2Issues:0

pytorch-distributed

A quickstart and benchmark for pytorch distributed training.

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

PyTorch_GBW_LM

PyTorch Language Model for 1-Billion Word (LM1B / GBW) Dataset

License:Apache-2.0Stargazers:0Issues:0Issues:0

sketchedsgd

Sketched SGD

Language:PythonLicense:GPL-3.0Stargazers:0Issues:1Issues:0
Language:PythonStargazers:0Issues:2Issues:0

TensorFlow-Tutorials

Simple tutorials using Google's TensorFlow Framework

Language:Jupyter NotebookStargazers:0Issues:2Issues:0

tensorflow_multigpu_imagenet

Tensorflow code for training different architectures(DenseNet, ResNet, AlexNet, GoogLeNet, VGG, NiN) on ImageNet dataset + Multi-GPU support + Transfer Learning support

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

training-bottleneck

Analyze network performance in distributed training

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

tutorials

机器学习相关教程

Language:PythonLicense:MITStargazers:0Issues:2Issues:0