Wei Huang's starred repositories
micronet
micronet, a model compression and deploy lib. compression: 1、quantization: quantization-aware-training(QAT), High-Bit(>2b)(DoReFa/Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference)、Low-Bit(≤2b)/Ternary and Binary(TWN/BNN/XNOR-Net); post-training-quantization(PTQ), 8-bit(tensorrt); 2、 pruning: normal、regular and group convolutional channel pruning; 3、 group convolution structure; 4、batch-normalization fuse for quantization. deploy: tensorrt, fp32/fp16/int8(ptq-calibration)、op-adapt(upsample)、dynamic_shape
tensorpack
A Neural Net Training Interface on TensorFlow, with focus on speed + flexibility
pytorch_resnet_cifar10
Proper implementation of ResNet-s for CIFAR10/100 in pytorch that matches description of the original paper.
LLM-Pruner
[NeurIPS 2023] LLM-Pruner: On the Structural Pruning of Large Language Models. Support Llama-3/3.1, Llama-2, LLaMA, BLOOM, Vicuna, Baichuan, TinyLlama, etc.
DejaVu_predictor
The codes for training sparsity predictor on LLaMA.
accelerate
🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support
llm-autoeval
Automatically evaluate your LLMs in Google Colab
human-eval
Code for the paper "Evaluating Large Language Models Trained on Code"
lm-evaluation-harness
A framework for few-shot evaluation of language models.
lm-evaluation-harness
A framework for few-shot evaluation of language models.
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Modularity-Analysis
Repo for ACL2023 Findings paper "Emergent Modularity in Pre-trained Transformers"
learning_research
本人的科研经验
Awesome-Efficient-LLM
A curated list for Efficient Large Language Models
lm-evaluation-harness
A framework for few-shot evaluation of language models.
hugo-blox-builder
🚨 GROW YOUR AUDIENCE WITH HUGOBLOX! 🚀 HugoBlox is an easy, fast no-code website builder for researchers, entrepreneurs, data scientists, and developers. Build stunning sites in minutes. 适合研究人员、企业家、数据科学家和开发者的简单快速无代码网站构建器。用拖放功能、可定制模板和内置SEO工具快速创建精美网站!
openmlsys-zh
《Machine Learning Systems: Design and Implementation》- Chinese Version