hustzxd

zhaoxiandong's repositories

LSQuantization

The PyTorch implementation of Learned Step size Quantization (LSQ) in ICLR2020 (unofficial)

Language:Jupyter Notebook123 5 9

EfficientPyTorch

A PyTorch Framework for Efficient Pruning and Quantization for specialized accelerators.

Language:Jupyter Notebook31 2 5

EagleEyeEFF

Implement channel pruning using the latest Torch.FX feature !!! && EagleEye reimplementation

Language:Python6 10

blog

my blog backup

Language:Stylus3 10

PaperListTemplate

This template makes it easy for you to manage papers.

Language:Python2 20

examples-run

A set of examples around pytorch in Vision with TRAINING BASH.

Language:PythonBSD-3-Clause1 20

ABCPruner

Pytorch implementation of our paper accepted by IJCAI 2020 -- Channel Pruning via Automatic Structure Search

Language:Python010

aimet

AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.

Language:PythonNOASSERTION010

ASKs

Asks: Convolution with any-shape kernels for efficient neural networks (Neurocomputing.2021)

Language:Python000

attention-is-all-you-need-paper

Implementation of Vaswani, Ashish, et al. "Attention is all you need." Advances in neural information processing systems. 2017.

Language:Jupyter NotebookMIT000

Awesome-Efficient-LLM

A curated list for Efficient Large Language Models

000

awesome-image-transformer

List of all the papers on Transformers for Vision.

Apache-2.0010

BitSplit

BitSplit Post-trining Quantization

Language:PythonApache-2.0010

Dynamic-convolution-Pytorch

Pytorch!!!Pytorch!!!Pytorch!!! Dynamic Convolution: Attention over Convolution Kernels (CVPR-2020)

Language:Python010

dynamic-pruning

Language:Python010

EagleEye

(ECCV'2020 Oral)EagleEye: Fast Sub-net Evaluation for Efficient Neural Network Pruning

Language:Python000

hustzxd

020

ictlogin

Language:Python000

litgpt

Pretrain, finetune, deploy 20+ LLMs on your own data. Uses state-of-the-art techniques: flash attention, FSDP, 4-bit, LoRA, and more.

Language:PythonApache-2.0000

llama

Inference code for LLaMA models

Language:PythonGPL-3.0000

LoRA

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Language:PythonMIT000

MQBench

Model Quantization Benchmark

Language:PythonApache-2.0010

pytorch-cifar

95.47% on CIFAR10 with PyTorch

Language:PythonMIT000

pytorch-cifar-models

Pretrained models on CIFAR10/100 in PyTorch

Language:PythonBSD-3-Clause000

rocmstat

📊 A simple command-line utility for querying and monitoring GPU status

Language:Python000

simplenote-android

Simplenote for Android

Language:JavaGPL-2.0000

supermariopy

python library, scripts and notebooks that are usfull from time to time

Language:PythonMIT000

triton

Development repository for the Triton language and compiler

MIT000

tutorials

PyTorch tutorials.

Language:PythonBSD-3-Clause000

tvm

Open deep learning compiler stack for cpu, gpu and specialized accelerators

Language:PythonApache-2.0000