Chen Shangyu's repositories
Awesome-Deep-Neural-Network-Compression
Summary, Code for Deep Neural Network Quantization
attention-is-all-you-need-pytorch
A PyTorch implementation of the Transformer model in "Attention is All You Need".
pytorch-vgg-cifar10
This is the PyTorch implementation of VGG network trained on CIFAR10 dataset
Scarce-Data
Setting and Evaluation Protocol of Scarce Data
transformers
🤗 Transformers: State-of-the-art Natural Language Processing for TensorFlow 2.0 and PyTorch.
Awesome-System-for-Machine-Learning
A curated list of research in machine learning system. I also summarize some papers if I think they are really interesting.
ComiRec
Source code and dataset for KDD 2020 paper "Controllable Multi-Interest Framework for Recommendation"
csyhhu.github.io
My blog
google-research
Google Research
learning-to-learn
Learning to Learn in TensorFlow
llama
Inference code for LLaMA models
mean-teacher
A state-of-the-art semi-supervised method for image recognition
Megatron-LM
Ongoing research training transformer models at scale
meta-dataset
A dataset of datasets for learning to learn from few examples
mixup-cifar10
mixup: Beyond Empirical Risk Minimization
pytorch-cifar
95.16% on CIFAR10 with PyTorch
PyTorch-GAN
PyTorch implementations of Generative Adversarial Networks.
pytorch-mobilenet-v2
A PyTorch implementation of MobileNet V2 architecture and pretrained model.
pytorch-tensor-decompositions
PyTorch implementation of [1412.6553] and [1511.06530] tensor decomposition methods for convolutional layers.
sparsegpt
Code for the paper "SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot".
stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.