hanrui1sensetime's repositories
PoseTracker-Android-Prototype
PoseTracker Android Demo Prototype.
MMDeployX-prototype
The prototype of MMDeployX
GPTQ-for-PULSE
4 bits quantization of PULSE models using GPTQ
Atom
Atom: Low-bit Quantization for Efficient and Accurate LLM Serving
AutoAWQ
AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference.
llm-awq
AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration
lmdeploy
LMDeploy is a toolkit for compressing, deploying, and serving LLM
mmaction2
OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark
mmclassification
OpenMMLab Image Classification Toolbox and Benchmark
MMDeployX-APK
APK resources of MMDeploy-X
mmdetection
OpenMMLab Detection Toolbox and Benchmark
mmediting
OpenMMLab Image and Video Restoration, Editing and Generation Toolbox
mmsegmentation
OpenMMLab Semantic Segmentation Toolbox and Benchmark.
mmpose
OpenMMLab Pose Estimation Toolbox and Benchmark.
ncnn
ncnn is a high-performance neural network inference framework optimized for the mobile platform
OmniQuant
OmniQuant is a simple and powerful quantization technique for LLMs.
PaddleVideo
Awesome video understanding toolkits based on PaddlePaddle. It supports video data annotation tools, lightweight RGB and skeleton based action recognition model, practical applications for video tagging and sport action detection.
QUIK
Repository for the QUIK project, enabling the use of 4bit kernels for generative inference
test_repo
my test repo
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.