ModelTC's repositories
United-Perception
United Perception
llmc
This is the official PyTorch implementation of "LLM-QBench: A Benchmark Towards the Best Practice for Post-training Quantization of Large Language Models", and also an efficient LLM compression tool with various advanced compression methods, supporting multiple inference backends.
awesome-lm-system
Summary of system papers/frameworks/codes/tools on training or serving large model
Outlier_Suppression_Plus
Official implementation of the EMNLP23 paper: Outlier Suppression+: Accurate quantization of large language models by equivalent and optimal shifting and scaling
AAAI2023_EAMPD
AAAI2023 Efficient and Accurate Models towards Practical Deep Learning Baseline
Imagenet-S
Robustness for real-world system noise
ChatGLM-6B
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
general-sam
A general suffix automaton implementation in Rust with Python bindings
mtc-token-healing
Token healing implementation in Rust
general-sam-py
Python bindings for general-sam and some utilities
greedy-tokenizer
Greedily tokenize strings with the longest tokens iteratively.