Allen.Dou's repositories
vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:PythonApache-2.0000
AutoAWQ
AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:
MIT000
AutoGPTQ
An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
MIT000
Apache-2.0000
update-kube-cert
(deal with K8s cluster certificate expired) K8s 集群证书过期处理,更新 kubeadm 生成的证书有效期为 10 年。1.15.x 以上版本可直接 kubeadm alpha certs renew <cert_name> 更新
MIT000
chnroutes
scripts help chinese netizen, who uses vpn to combat censorship, by modifying the route table so as routing only the censored ip to the vpn
Language:Python000
git-diffall
Script to perform directory diffs using an external diff tool in Git
Language:Shell000