FlexInfer

A flexible Python front-end inference SDK.

Features

Flexible

FlexInfer has a Python front-end, which makes it easy to build a computer vision product prototype.
Efficient

Most of time consuming part of FlexInfer is powered by C++ or CUDA, so FlexInfer is also efficient. If you are really hungry for efficiency and don't mind the trouble of C++, you can refer to CheetahInfer.

This project is released under Apache 2.0 license.

We have tested the following versions of OS and softwares:

If your platform is x86 or x64, you can create a conda virtual environment and activate it.

conda create -n flexinfer python=3.6.9 -y
conda activate flexinfer

pip install "git+https://github.com/Media-Smart/flexinfer.git"

We provide some examples for different tasks.

Tasks		framework	version	input shape	data type	throughput(FPS)	latency(ms)
Classification (ResNet18)		PyTorch	1.5.0	(1, 3, 224, 224)	FP16	172	6.01
Classification (ResNet18)		TensorRT	7.1.0.16	(1, 3, 224, 224)	FP16	754	1.8
Segmentation（U-Net）		PyTorch	1.5.0	(1, 3, 513, 513)	FP16	15	63.27
Segmentation（U-Net）		tensorrt	7.1.0.16	(1, 3, 513, 513)	FP16	29	34.03
Object Detection	RetinaNet-R50	PyTorch	1.5.0	(1, 3, 768, 1280)	FP16	8	118.79
	RetinaNet-R50	TensorRT	7.1.0.16	(1, 3, 768, 1280)	FP16	15	68.10
	TinaFace-R50-FPN-BN	PyTorch	1.5.0	(1, 3, 768, 1280)	FP16	3	273.60
	TinaFace-R50-FPN-BN	TensorRT	7.1.0.16	(1, 3, 768, 1280)	FP16	6	159.70
Scene Text Recognition (ResNet-CTC)		PyTorch	1.5.0	(1, 1, 32, 100)	FP16	113	10.75
Scene Text Recognition (ResNet-CTC)		TensorRT	7.1.0.16	(1, 1, 32, 100)	FP16	308	3.55

We provide some toolboxes of different tasks for training, testing and deploying.

This repository is currently maintained by Yuxin Zou (@Yuxin Zou), Jun Sun(@ChaseMonsterAway), Hongxiang Cai (@hxcai) and Yichao Xiong (@mileistone).

A flexible Python front-end inference SDK based on TensorRT

Apache License 2.0

Language:Python 91.6%Language:C++ 5.9%Language:Cuda 2.5%