Ther's repositories
Awesome-Transformer-Accleration
Paper list for accleration of transformers
Ther-nullptr.github.io
personal blogs
smoothquant
SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models
A-ViT
Official PyTorch implementation of A-ViT: Adaptive Tokens for Efficient Vision Transformer (CVPR 2022)
Analytical-Model-for-GPT-Mapping
An analytical model for GPT mapping strategy, graduation project
bitsandbytes
8-bit CUDA functions for PyTorch
ChatGLM-6B
ChatGLM-6B:开源双语对话语言模型
ColossalAI
Making big AI models cheaper, easier, and scalable
DSP-Final-Lab
Final Lab of Digital Signal Processing (2022 fall)
FasterTransformer
Transformer related optimization, including BERT, GPT
FlexGen
Running large language models like OPT-175B/GPT-3 on a single GPU. Focusing on high-throughput large-batch generation.
FQ-ViT
[IJCAI 2022] FQ-ViT: Post-Training Quantization for Fully Quantized Vision Transformer
gem5-stable
Lab Platform of Modern Computer Architecture
ImageRepository
Repository for images
miniWeather
A parallel programming training mini app simulating weather-like flows
MT4SSL
MT4SSL: Boosting Self-Supervised Speech Representation Learning by Integrating Multiple Targets
Plot_Brillouin_Zone
lab for Foundation of Solid State Physics
Q-ViT-DeiT
DeiT implementation for Q-ViT
qlora
QLoRA: Efficient Finetuning of Quantized LLMs
RPTQ4LLM
Reorder-based post-training quantization for large language model
Sparsebit
A model compression and acceleration toolbox based on pytorch.
STD_LAB
LAB for Introduction to Auditory-visual Information System
THUAI6
清华大学第六届人工智能挑战赛电子系赛道(原电子系第 24 届队式程序设计大赛 teamstyle24)
Torch-Pruning
[CVPR-2023] Towards Any Structural Pruning; LLaMA / YOLOv8 / CNNs / Transformers
torchinfo
View model summaries in PyTorch!