huangwei021230

Wei Huang's starred repositories

nni

An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.

Language:PythonMIT1405400

Moonlit

This is a collection of our research on efficient AI, covering hardware-aware NAS and model compression.

Language:PythonMIT7300

MQBench

Model Quantization Benchmark

Language:ShellApache-2.076400

micronet, a model compression and deploy lib. compression: 1、quantization: quantization-aware-training(QAT), High-Bit(>2b)(DoReFa/Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference)、Low-Bit(≤2b)/Ternary and Binary(TWN/BNN/XNOR-Net); post-training-quantization(PTQ), 8-bit(tensorrt); 2、 pruning: normal、regular and group convolutional channel pruning; 3、 group convolution structure; 4、batch-normalization fuse for quantization. deploy: tensorrt, fp32/fp16/int8(ptq-calibration)、op-adapt(upsample)、dynamic_shape

Language:PythonMIT221800

tensorpack

A Neural Net Training Interface on TensorFlow, with focus on speed + flexibility

Language:PythonApache-2.0630300

pytorch_resnet_cifar10

Proper implementation of ResNet-s for CIFAR10/100 in pytorch that matches description of the original paper.

Language:PythonBSD-2-Clause122400

lsq-net

Unofficial implementation of LSQ-Net, a neural network quantization framework

Language:PythonMIT27700

LLM-Pruner

[NeurIPS 2023] LLM-Pruner: On the Structural Pruning of Large Language Models. Support Llama-3/3.1, Llama-2, LLaMA, BLOOM, Vicuna, Baichuan, TinyLlama, etc.

Language:PythonApache-2.087300

DejaVu_predictor

The codes for training sparsity predictor on LLaMA.

Language:Python1500

MambaOut

MambaOut: Do We Really Need Mamba for Vision?

Language:PythonApache-2.0203200

accelerate

🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Language:PythonApache-2.0794800

llm-autoeval

Automatically evaluate your LLMs in Google Colab

Language:PythonMIT55800

FlexFlow

FlexFlow Serve: Low-Latency, High-Performance LLM Serving

Language:C++Apache-2.0171000

human-eval

Code for the paper "Evaluating Large Language Models Trained on Code"

Language:PythonMIT241400

lm-evaluation-harness

A framework for few-shot evaluation of language models.

Language:PythonMIT100

lm-bench

Language:Python100

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonApache-2.03030700

wanda

A simple and effective LLM pruning approach.

Language:PythonMIT66900

lm-evaluation-harness

A framework for few-shot evaluation of language models.

Language:PythonMIT100

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonApache-2.0100

test

Measuring Massive Multitask Language Understanding | ICLR 2021

Language:PythonMIT121300

Modularity-Analysis

Repo for ACL2023 Findings paper "Emergent Modularity in Pre-trained Transformers"

Language:PythonMIT2000

learning_research

本人的科研经验

592700

Awesome-Efficient-LLM

A curated list for Efficient Large Language Models

Language:Python126500

lm-evaluation-harness

A framework for few-shot evaluation of language models.

Language:PythonMIT698100

Decentralized_FM_alpha

Language:Python1900

hugo-blox-builder

🚨 GROW YOUR AUDIENCE WITH HUGOBLOX! 🚀 HugoBlox is an easy, fast no-code website builder for researchers, entrepreneurs, data scientists, and developers. Build stunning sites in minutes. 适合研究人员、企业家、数据科学家和开发者的简单快速无代码网站构建器。用拖放功能、可定制模板和内置SEO工具快速创建精美网站！

Language:HTMLMIT837700

openmlsys-zh

《Machine Learning Systems: Design and Implementation》- Chinese Version

Language:TeX407700

DejaVu

Language:Python28700

nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Language:PythonMIT3737700