zhaoxiandong's repositories
LSQuantization
The PyTorch implementation of Learned Step size Quantization (LSQ) in ICLR2020 (unofficial)
EfficientPyTorch
A PyTorch Framework for Efficient Pruning and Quantization for specialized accelerators.
EagleEyeEFF
Implement channel pruning using the latest Torch.FX feature !!! && EagleEye reimplementation
PaperListTemplate
This template makes it easy for you to manage papers.
examples-run
A set of examples around pytorch in Vision with TRAINING BASH.
ASKs
Asks: Convolution with any-shape kernels for efficient neural networks (Neurocomputing.2021)
attention-is-all-you-need-paper
Implementation of Vaswani, Ashish, et al. "Attention is all you need." Advances in neural information processing systems. 2017.
Awesome-Efficient-LLM
A curated list for Efficient Large Language Models
awesome-image-transformer
List of all the papers on Transformers for Vision.
Dynamic-convolution-Pytorch
Pytorch!!!Pytorch!!!Pytorch!!! Dynamic Convolution: Attention over Convolution Kernels (CVPR-2020)
EagleEye
(ECCV'2020 Oral)EagleEye: Fast Sub-net Evaluation for Efficient Neural Network Pruning
litgpt
Pretrain, finetune, deploy 20+ LLMs on your own data. Uses state-of-the-art techniques: flash attention, FSDP, 4-bit, LoRA, and more.
llama
Inference code for LLaMA models
LoRA
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
pytorch-cifar
95.47% on CIFAR10 with PyTorch
pytorch-cifar-models
Pretrained models on CIFAR10/100 in PyTorch
rocmstat
📊 A simple command-line utility for querying and monitoring GPU status
simplenote-android
Simplenote for Android
supermariopy
python library, scripts and notebooks that are usfull from time to time
triton
Development repository for the Triton language and compiler
tutorials
PyTorch tutorials.
tvm
Open deep learning compiler stack for cpu, gpu and specialized accelerators