Zhiqiang Wang's repositories
AITemplate
AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.
caffe2onnx
caffe model to onnx
EasyCV
An all-in-one toolkit for computer vision
FastDeploy
⚡️An Easy-to-use and Fast Deep Learning Model Deployment Toolkit for ☁️Cloud 📱Mobile and 📹Edge. Including Image, Video, Text and Audio 20+ main stream scenarios and 150+ SOTA models with end-to-end optimization, multi-platform and multi-framework support.
mlprodict
Productionize machine learning predictions, with ONNX or without
mmdetection3d
OpenMMLab's next-generation platform for general 3D object detection.
mmyolo
OpenMMLab YOLO series toolbox and benchmark
namex
Clean up the public namespace of your package!
nn-Meter
A DNN inference latency prediction toolkit for accurately modeling and predicting the latency on diverse edge devices.
nni
An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.
onnx-simplifier
Simplify your onnx model
onnx2torch
Convert ONNX models to PyTorch.
ppq
PPL Quantization Tool (PPQ) is a powerful offline neural network quantization tool.
shumai
Fast ML in JS with Bun + Flashlight
Sparsebit
A model compression and acceleration toolbox based pytorch.
TNN
TNN: developed by Tencent Youtu Lab and Guangying Lab, a uniform deep learning inference framework for mobile、desktop and server. TNN is distinguished by several outstanding features, including its cross-platform capability, high performance, model compression and code pruning. Based on ncnn and Rapidnet, TNN further strengthens the support and p
torch-tensorrt
PyTorch/TorchScript/FX compiler for NVIDIA GPUs using TensorRT
yolov5-oneflow
A more efficient yolov5 with oneflow backend 🎉🎉🎉