Awesome Computer Vision Models

A curated list of popular classification, segmentation and detection models with corresponding evaluation metrics from papers.

Classification models
Segmentation models
Detection models

Classification models

Model	Number of parameters	FLOPS	Top-1 Error	Top-5 Error	Year
AlexNet ('One weird trick for parallelizing convolutional neural networks')	62.3M	1,132.33M	40.96	18.24	2014
VGG-16 ('Very Deep Convolutional Networks for Large-Scale Image Recognition')	138.3M	?	26.78	8.69	2014
ResNet-10 ('Deep Residual Learning for Image Recognition')	5.5M	894.04M	34.69	14.36	2015
ResNet-18 ('Deep Residual Learning for Image Recognition')	11.7M	1,820.41M	28.53	9.82	2015
ResNet-34 ('Deep Residual Learning for Image Recognition')	21.8M	3,672.68M	24.84	7.80	2015
ResNet-50 ('Deep Residual Learning for Image Recognition')	25.5M	3,877.95M	22.28	6.33	2015
InceptionV3 ('Rethinking the Inception Architecture for Computer Vision')	23.8M	?	21.2	5.6	2015
PreResNet-18 ('Identity Mappings in Deep Residual Networks')	11.7M	1,820.56M	28.43	9.72	2016
PreResNet-34 ('Identity Mappings in Deep Residual Networks')	21.8M	3,672.83M	24.89	7.74	2016
PreResNet-50 ('Identity Mappings in Deep Residual Networks')	25.6M	3,875.44M	22.40	6.47	2016
DenseNet-121 ('Densely Connected Convolutional Networks')	8.0M	2,872.13M	23.48	7.04	2016
DenseNet-161 ('Densely Connected Convolutional Networks')	28.7M	7,793.16M	22.86	6.44	2016
PyramidNet-101 ('Deep Pyramidal Residual Networks')	42.5M	8,743.54M	21.98	6.20	2016
ResNeXt-14(32x4d) ('Aggregated Residual Transformations for Deep Neural Networks')	9.5M	1,603.46M	30.32	11.46	2016
ResNeXt-26(32x4d) ('Aggregated Residual Transformations for Deep Neural Networks')	15.4M	2,488.07M	24.14	7.46	2016
WRN-50-2 ('Wide Residual Networks')	68.9M	11,405.42M	22.53	6.41	2016
Xception ('Xception: Deep Learning with Depthwise Separable Convolutions')	22,855,952	8,403.63M	20.97	5.49	2016
InceptionV4 ('Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning')	42,679,816	12,304.93M	20.64	5.29	2016
InceptionResNetV2 ('Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning')	55,843,464	13,188.64M	19.93	4.90	2016
PolyNet ('PolyNet: A Pursuit of Structural Diversity in Very Deep Networks')	95,366,600	34,821.34M	19.10	4.52	2016
DarkNet Ref ('Darknet: Open source neural networks in C')	7,319,416	367.59M	38.58	17.18	2016
DarkNet Tiny ('Darknet: Open source neural networks in C')	1,042,104	500.85M	40.74	17.84	2016
DarkNet 53 ('Darknet: Open source neural networks in C')	41,609,928	7,133.86M	21.75	5.64	2016
SqueezeResNet1.1 ('SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and <0.5MB model size')	1,235,496	352.02M	40.09	18.21	2016
SqueezeNet1.1 ('SqueezeNet: AlexNet-level accuracy with 50x fewer parameters and <0.5MB model size')	1,235,496	352.02M	39.31	17.72	2016
ResAttNet-92 ('Residual Attention Network for Image Classification')	51.3M	?	19.5	4.8	2017
CondenseNet (G=C=8) ('CondenseNet: An Efficient DenseNet using Learned Group Convolutions')	4.8M	?	26.2	8.3	2017
DPN-68 ('Dual Path Networks')	12,611,602	2,351.84M	23.24	6.79	2017
ShuffleNet x1.0 (g=1) ('ShuffleNet: An Extremely Efficient Convolutional Neural Network for Mobile Devices')	1,531,936	148.13M	34.93	13.89	2017
DiracNetV2-18 ('DiracNets: Training Very Deep Neural Networks Without Skip-Connections')	11,511,784	1,796.62M	31.47	11.70	2017
DiracNetV2-34 ('DiracNets: Training Very Deep Neural Networks Without Skip-Connections')	21,616,232	3,646.93M	28.75	9.93	2017
SENet-16 ('Squeeze-and-Excitation Networks')	31,366,168	5,081.30M	25.65	8.20	2017
SENet-154 ('Squeeze-and-Excitation Networks')	115,088,984	20,745.78M	18.62	4.61	2017
MobileNet ('MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications')	4,231,976	579.80M	26.61	8.95	2017
NASNet-A 4@1056 ('Learning Transferable Architectures for Scalable Image Recognition')	5,289,978	584.90M	25.68	8.16	2017
NASNet-A 6@4032('Learning Transferable Architectures for Scalable Image Recognition')	88,753,150	23,976.44M	18.14	4.21	2017
DLA-34 ('Deep Layer Aggregation')	15,742,104	3,071.37M	25.36	7.94	2017
AirNet50-1x64d (r=2) ('Attention Inspiring Receptive-Fields Network for Learning Invariant Representations')	27.43M	?	22.48	6.21	2018
BAM-ResNet-50 ('BAM: Bottleneck Attention Module')	25.92M	?	23.68	6.96	2018
CBAM-ResNet-50 ('CBAM: Convolutional Block Attention Module')	28.1M	?	23.02	6.38	2018
1.0-SqNxt-23v5 ('SqueezeNext: Hardware-Aware Neural Network Design')	921,816	285.82M	40.77	17.85	2018
1.5-SqNxt-23v5 ('SqueezeNext: Hardware-Aware Neural Network Design')	1,953,616	550.97M	33.81	13.01	2018
2.0-SqNxt-23v5 ('SqueezeNext: Hardware-Aware Neural Network Design')	3,366,344	897.60M	29.63	10.66	2018
ShuffleNetV2 ('ShuffleNet V2: Practical Guidelines for Efficient CNN Architecture Design')	2,278,604	149.72M	31.44	11.63	2018
456-MENet-24×1(g=3) ('Merging and Evolution: Improving Convolutional Neural Networks for Mobile Applications')	5.3M	?	28.4	9.8	2018
FD-MobileNet ('FD-MobileNet: Improved MobileNet with A Fast Downsampling Strategy')	2,901,288	147.46M	34.23	13.38	2018
MobileNetV2 ('MobileNetV2: Inverted Residuals and Linear Bottlenecks')	3,504,960	329.36M	26.97	8.87	2018
IGCV3 ('IGCV3: Interleaved Low-Rank Group Convolutions for Efficient Deep Neural Networks')	3.5M	?	28.22	9.54	2018
DARTS ('DARTS: Differentiable Architecture Search')	4.9M	?	26.9	9.0	2018
PNASNet-5 ('Progressive Neural Architecture Search')	5.1M	?	25.8	8.1	2018
AmoebaNet-C ('Regularized Evolution for Image Classifier Architecture Search')	5.1M	?	24.3	7.6	2018
MnasNet ('MnasNet: Platform-Aware Neural Architecture Search for Mobile')	4,308,816	317.67M	31.58	11.74	2018
IBN-Net50-a ('Two at Once: Enhancing Learning andGeneralization Capacities via IBN-Net')	?	?	22.54	6.32	2018
MarginNet ('Large Margin Deep Networks for Classification')	?	?	22.0	?	2018
A^2 Net ('A^2-Nets: Double Attention Networks')	?	?	23.0	6.5	2018
FishNeXt-150 ('FishNet: A Versatile Backbone for Image, Region, and Pixel Level Prediction')	26.2M	?	21.5	?	2018
Shape-ResNet ('IMAGENET-TRAINED CNNS ARE BIASED TOWARDS TEXTURE; INCREASING SHAPE BIAS IMPROVES ACCURACY AND ROBUSTNESS')	25.5M	?	23.28	6.72	2019
SimCNN(k=3 train) ('Greedy Layerwise Learning Can Scale to ImageNet')	?	?	28.4	10.2	2019
SKNet-50 ('Selective Kernel Networks')	27.5M	?	20.79	?	2019
SRM-ResNet-50 ('SRM : A Style-based Recalibration Module for Convolutional Neural Networks')	25.62M	?	22.87	6.49	2019
EfficientNet-B0 ('EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks')	5,288,548	414.31M	24.77	7.52	2019
EfficientNet-B7b ('EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks')	66,347,960	39,010.98M	15.94	3.22	2019
ProxylessNAS ('PROXYLESSNAS: DIRECT NEURAL ARCHITECTURE SEARCH ON TARGET TASK AND HARDWARE')	?	?	24.9	7.5	2019
MixNet-L ('MixNet: Mixed Depthwise Convolutional Kernels')	7.3M	?	21.1	5.8	2019
ECA-Net50 ('ECA-Net: Efficient Channel Attention for Deep Convolutional Neural Networks')	24.37M	3.86G	22.52	6.32	2019
ECA-Net101 ('ECA-Net: Efficient Channel Attention for Deep Convolutional Neural Networks')	7.3M	7.35G	21.35	5.66	2019
ACNet-Densenet121 ('ACNet: Strengthening the Kernel Skeletons for Powerful CNN via Asymmetric Convolution Blocks')	?	?	24.18	7.23	2019
LIP-ResNet-50 ('LIP: Local Importance-based Pooling')	23.9M	5.33G	21.81	6.04	2019
LIP-ResNet-101 ('LIP: Local Importance-based Pooling')	42.9M	9.06G	20.67	5.40	2019
LIP-DenseNet-BC-121 ('LIP: Local Importance-based Pooling')	8.7M	4.13G	23.36	6.84	2019
MuffNet_1.0 ('MuffNet: Multi-Layer Feature Federation for Mobile Deep Learning')	2.3M	146M	30.1	?	2019
MuffNet_1.5 ('MuffNet: Multi-Layer Feature Federation for Mobile Deep Learning')	3.4M	300M	26.9	?	2019
ResNet-34-Bin-5 ('Making Convolutional Networks Shift-Invariant Again')	21.8M	3,672.68M	25.80	?	2019
ResNet-50-Bin-5 ('Making Convolutional Networks Shift-Invariant Again')	25.5M	3,877.95M	22.96	?	2019
MobileNetV2-Bin-5 ('Making Convolutional Networks Shift-Invariant Again')	3,504,960	329.36M	27.50	?	2019
FixRes ResNeXt101 WSL ('Fixing the train-test resolution discrepancy')	829M	?	13.6	2.0	2019
Noisy Student*(L2) ('Self-training with Noisy Student improves ImageNet classification')	480M	?	12.6	1.8	2019
TResNet-M ('TResNet: High Performance GPU-Dedicated Architecture')	29.4M	5.5G	19.3	?	2020
DA-NAS-C ('DA-NAS: Data Adapted Pruning for Efficient Neural Architecture Search')	?	467M	23.8	?	2020
ResNeSt-50 ('ResNeSt: Split-Attention Networks')	27.5M	5.39G	18.87	?	2020
ResNeSt-101 ('ResNeSt: Split-Attention Networks')	48.3M	10.2G	17.73	?	2020
ResNet-50-FReLU ('Funnel Activation for Visual Recognition')	25.5M	3.87G	22.40	?	2020
ResNet-101-FReLU ('Funnel Activation for Visual Recognition')	44.5M	7.6G	22.10	?	2020
ResNet-50-MEALv2 ('MEAL V2: Boosting Vanilla ResNet-50 to 80%+ Top-1 Accuracy on ImageNet without Tricks')	25.6M	?	19.33	4.91	2020
ResNet-50-MEALv2 + CutMix ('MEAL V2: Boosting Vanilla ResNet-50 to 80%+ Top-1 Accuracy on ImageNet without Tricks')	25.6M	?	19.02	4.65	2020
MobileNet V3-Large-MEALv2 ('MEAL V2: Boosting Vanilla ResNet-50 to 80%+ Top-1 Accuracy on ImageNet without Tricks')	5.48M	?	23.08	6.68	2020
EfficientNet-B0-MEALv2 ('MEAL V2: Boosting Vanilla ResNet-50 to 80%+ Top-1 Accuracy on ImageNet without Tricks')	5.29M	?	21.71	6.05	2020
T2T-ViT-7 ('Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet')	4.2M	0.6G	28.8	?	2021
T2T-ViT-14 ('Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet')	19.4M	4.8G	19.4	?	2021
T2T-ViT-19 ('Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet')	39.0M	8.0G	18.8	?	2021
NFNet-F0 ('High-Performance Large-Scale Image Recognition Without Normalization')	71.5M	12.38G	16.4	3.2	2021
NFNet-F1 ('High-Performance Large-Scale Image Recognition Without Normalization')	132.6M	35.54G	15.4	2.9	2021
NFNet-F6+SAM ('High-Performance Large-Scale Image Recognition Without Normalization')	438.4M	377.28G	13.5	2.1	2021
EfficientNetV2-S ('EfficientNetV2: Smaller Models and Faster Training')	24M	8.8G	16.1	?	2021
EfficientNetV2-M ('EfficientNetV2: Smaller Models and Faster Training')	55M	24G	14.9	?	2021
EfficientNetV2-L ('EfficientNetV2: Smaller Models and Faster Training')	121M	53G	14.3	?	2021
EfficientNetV2-S (21k) ('EfficientNetV2: Smaller Models and Faster Training')	24M	8.8G	15.0	?	2021
EfficientNetV2-M (21k) ('EfficientNetV2: Smaller Models and Faster Training')	55M	24G	13.9	?	2021
EfficientNetV2-L (21k) ('EfficientNetV2: Smaller Models and Faster Training')	121M	53G	13.2	?	2021

Segmentation models

Model	Year	PASCAL-Context	Cityscapes (mIOU)	PASCAL VOC 2012 (mIOU)	COCO Stuff	ADE20K VAL (mIOU)
U-Net ('U-Net: Convolutional Networks for Biomedical Image Segmentation')	2015	?	?	?	?	?
DeconvNet ('Learning Deconvolution Network for Semantic Segmentation')	2015	?	?	72.5	?	?
ParseNet ('ParseNet: Looking Wider to See Better')	2015	40.4	?	69.8	?	?
Piecewise ('Efficient piecewise training of deep structured models for semantic segmentation')	2015	43.3	71.6	78.0	?	?
SegNet ('SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation')	2016	?	56.1	?	?	?
FCN ('Fully Convolutional Networks for Semantic Segmentation')	2016	37.8	65.3	62.2	22.7	29.39
ENet ('ENet: A Deep Neural Network Architecture for Real-Time Semantic Segmentation')	2016	?	58.3	?	?	?
DilatedNet ('MULTI-SCALE CONTEXT AGGREGATION BY DILATED CONVOLUTIONS')	2016	?	?	67.6	?	32.31
PixelNet ('PixelNet: Towards a General Pixel-Level Architecture')	2016	?	?	69.8	?	?
RefineNet ('RefineNet: Multi-Path Refinement Networks for High-Resolution Semantic Segmentation')	2016	47.3	73.6	83.4	33.6	40.70
LRR ('Laplacian Pyramid Reconstruction and Refinement for Semantic Segmentation')	2016	?	71.8	79.3	?	?
FRRN ('Full-Resolution Residual Networks for Semantic Segmentation in Street Scenes')	2016	?	71.8	?	?	?
MultiNet ('MultiNet: Real-time Joint Semantic Reasoning for Autonomous Driving')	2016	?	?	?	?	?
DeepLab ('DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs')	2017	45.7	64.8	79.7	?	?
LinkNet ('LinkNet: Exploiting Encoder Representations for Efficient Semantic Segmentation')	2017	?	?	?	?	?
Tiramisu ('The One Hundred Layers Tiramisu: Fully Convolutional DenseNets for Semantic Segmentation')	2017	?	?	?	?	?
ICNet ('ICNet for Real-Time Semantic Segmentation on High-Resolution Images')	2017	?	70.6	?	?	?
ERFNet ('Efficient ConvNet for Real-time Semantic Segmentation')	2017	?	68.0	?	?	?
PSPNet ('Pyramid Scene Parsing Network')	2017	47.8	80.2	85.4	?	44.94
GCN ('Large Kernel Matters — Improve Semantic Segmentation by Global Convolutional Network')	2017	?	76.9	82.2	?	?
Segaware ('Segmentation-Aware Convolutional Networks Using Local Attention Masks')	2017	?	?	69.0	?	?
PixelDCN ('PIXEL DECONVOLUTIONAL NETWORKS')	2017	?	?	73.0	?	?
DeepLabv3 ('Rethinking Atrous Convolution for Semantic Image Segmentation')	2017	?	?	85.7	?	?
DUC, HDC ('Understanding Convolution for Semantic Segmentation')	2018	?	77.1	?	?	?
ShuffleSeg ('SHUFFLESEG: REAL-TIME SEMANTIC SEGMENTATION NETWORK')	2018	?	59.3	?	?	?
AdaptSegNet ('Learning to Adapt Structured Output Space for Semantic Segmentation')	2018	?	46.7	?	?	?
TuSimple-DUC ('Understanding Convolution for Semantic Segmentation')	2018	80.1	?	83.1	?	?
R2U-Net ('Recurrent Residual Convolutional Neural Network based on U-Net (R2U-Net) for Medical Image Segmentation')	2018	?	?	?	?	?
Attention U-Net ('Attention U-Net: Learning Where to Look for the Pancreas')	2018	?	?	?	?	?
DANet ('Dual Attention Network for Scene Segmentation')	2018	52.6	81.5	?	39.7	?
ENCNet ('Context Encoding for Semantic Segmentation')	2018	51.7	75.8	85.9	?	44.65
ShelfNet ('ShelfNet for Real-time Semantic Segmentation')	2018	48.4	75.8	84.2	?	?
LadderNet ('LADDERNET: MULTI-PATH NETWORKS BASED ON U-NET FOR MEDICAL IMAGE SEGMENTATION')	2018	?	?	?	?	?
CCC-ERFnet ('Concentrated-Comprehensive Convolutions for lightweight semantic segmentation')	2018	?	69.01	?	?	?
DifNet-101 ('DifNet: Semantic Segmentation by Diffusion Networks')	2018	45.1	?	73.2	?	?
BiSeNet(Res18) ('BiSeNet: Bilateral Segmentation Network for Real-time Semantic Segmentation')	2018	?	?	74.7	28.1	?
ESPNet ('ESPNet: Efficient Spatial Pyramid of Dilated Convolutions for Semantic Segmentation')	2018	?	?	63.01	?	?
SPADE ('Semantic Image Synthesis with Spatially-Adaptive Normalization')	2019	?	62.3	?	37.4	38.5
SeamlessSeg ('Seamless Scene Segmentation')	2019	?	77.5	?	?	?
EMANet ('Expectation-Maximization Attention Networks for Semantic Segmentation')	2019	?	?	88.2	39.9	?

Detection models

Model	Year	VOC07 (mAP@IoU=0.5)	VOC12 (mAP@IoU=0.5)	COCO (mAP)
R-CNN ('Rich feature hierarchies for accurate object detection and semantic segmentation')	2014	58.5	?	?
OverFeat ('OverFeat: Integrated Recognition, Localization and Detection using Convolutional Networks')	2014	?	?	?
MultiBox ('Scalable Object Detection using Deep Neural Networks')	2014	29.0	?	?
SPP-Net ('Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition')	2014	59.2	?	?
MR-CNN ('Object detection via a multi-region & semantic segmentation-aware CNN model')	2015	78.2	73.9	?
AttentionNet ('AttentionNet: Aggregating Weak Directions for Accurate Object Detection')	2015	?	?	?
Fast R-CNN ('Fast R-CNN')	2015	70.0	68.4	?
Fast R-CNN ('Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks')	2015	73.2	70.4	36.8
YOLO v1 ('You Only Look Once: Unified, Real-Time Object Detection')	2016	66.4	57.9	?
G-CNN ('G-CNN: an Iterative Grid Based Object Detector')	2016	66.8	66.4	?
AZNet ('Adaptive Object Detection Using Adjacency and Zoom Prediction')	2016	70.4	?	22.3
ION ('Inside-Outside Net: Detecting Objects in Context with Skip Pooling and Recurrent Neural Networks')	2016	80.1	77.9	33.1
HyperNet ('HyperNet: Towards Accurate Region Proposal Generation and Joint Object Detection')	2016	76.3	71.4	?
OHEM ('Training Region-based Object Detectors with Online Hard Example Mining')	2016	78.9	76.3	22.4
MPN ('A MultiPath Network for Object Detection')	2016	?	?	33.2
SSD ('SSD: Single Shot MultiBox Detector')	2016	76.8	74.9	31.2
GBDNet ('Crafting GBD-Net for Object Detection')	2016	77.2	?	27.0
CPF ('Contextual Priming and Feedback for Faster R-CNN')	2016	76.4	72.6	?
MS-CNN ('A Unified Multi-scale Deep Convolutional Neural Network for Fast Object Detection')	2016	?	?	?
R-FCN ('R-FCN: Object Detection via Region-based Fully Convolutional Networks')	2016	79.5	77.6	29.9
PVANET ('PVANET: Deep but Lightweight Neural Networks for Real-time Object Detection')	2016	?	?	?
DeepID-Net ('DeepID-Net: Deformable Deep Convolutional Neural Networks for Object Detection')	2016	69.0	?	?
NoC ('Object Detection Networks on Convolutional Feature Maps')	2016	71.6	68.8	27.2
DSSD ('DSSD : Deconvolutional Single Shot Detector')	2017	81.5	80.0	?
TDM ('Beyond Skip Connections: Top-Down Modulation for Object Detection')	2017	?	?	37.3
FPN ('Feature Pyramid Networks for Object Detection')	2017	?	?	36.2
YOLO v2 ('YOLO9000: Better, Faster, Stronger')	2017	78.6	73.4	21.6
RON ('RON: Reverse Connection with Objectness Prior Networks for Object Detection')	2017	77.6	75.4	?
DCN ('Deformable Convolutional Networks')	2017	?	?	?
DeNet ('DeNet: Scalable Real-time Object Detection with Directed Sparse Sampling')	2017	77.1	73.9	33.8
CoupleNet ('CoupleNet: Coupling Global Structure with Local Parts for Object Detection')	2017	82.7	80.4	34.4
RetinaNet ('Focal Loss for Dense Object Detection')	2017	?	?	39.1
Mask R-CNN ('Mask R-CNN')	2017	?	?	39.8
DSOD ('DSOD: Learning Deeply Supervised Object Detectors from Scratch')	2017	77.7	76.3	?
SMN ('Spatial Memory for Context Reasoning in Object Detection')	2017	70.0	?	?
YOLO v3 ('YOLOv3: An Incremental Improvement')	2018	?	?	33.0
SIN ('Structure Inference Net: Object Detection Using Scene-Level Context and Instance-Level Relationships')	2018	76.0	73.1	23.2
STDN ('Scale-Transferrable Object Detection')	2018	80.9	?	?
RefineDet ('Single-Shot Refinement Neural Network for Object Detection')	2018	83.8	83.5	41.8
MegDet ('MegDet: A Large Mini-Batch Object Detector')	2018	?	?	?
RFBNet ('Receptive Field Block Net for Accurate and Fast Object Detection')	2018	82.2	?	?
CornerNet ('CornerNet: Detecting Objects as Paired Keypoints')	2018	?	?	42.1
LibraRetinaNet ('Libra R-CNN: Towards Balanced Learning for Object Detection')	2019	?	?	43.0
YOLACT-700 ('YOLACT Real-time Instance Segmentation')	2019	?	?	31.2
DetNASNet(3.8) ('DetNAS: Backbone Search for Object Detection')	2019	?	?	42.0
YOLOv4 ('YOLOv4: Optimal Speed and Accuracy of Object Detection')	2020	?	?	46.7
SOLO ('SOLO: Segmenting Objects by Locations')	2020	?	?	37.8
D-SOLO ('SOLO: Segmenting Objects by Locations')	2020	?	?	40.5
SNIPER ('Scale Normalized Image Pyramids with AutoFocus for Object Detection')	2021	86.6	?	47.9
AutoFocus ('Scale Normalized Image Pyramids with AutoFocus for Object Detection')	2021	85.8	?	47.9

gmalivenko / awesome-computer-vision-models

Awesome Computer Vision Models

Contents

Classification models

Segmentation models

Detection models

About