TimVerion / DNNCode

DNN learning: Classification、Landmark、CTC OCR

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

DNN learning

DNN Code including classification,landmark,text recognition...
Training and testing script of each DNN project is included.

Classification:

Classification models are trained and evaluated on the task of character classification including 36 characters(10 digits and 26 uppercase letter).
input: 28 x 28 x 3
environment: openvino 2019 、CORE i7、GTX 1060

Names of Classfier are just the general type of CNN net,readers can go into codes to find specific type. Readers can fine-tuning the network architecture for your own tasks. (The accuarcy and speed of every classifier is only for your reference,
it may floats because of training steps 、learning rate and any other hyper parameter.)

Classifier Accuarcy (%) Speed (ms)
my classification 99.6 1.62
MobileNetV1 99.3 0.63
MobileNetV2 99.7 1.12
ResNetV1 99.8 4.22
ResNetV2 99.2 4.03
ShuffleNetV1 92.7 1.52
ShuffleNetV2 97.7 1.75
ResNeXt(C=1) 99.0 1.02
ResNeXt(C=32) 97.8 12.80
DenseNet 98.4 5.40
DenseNet(My) 99.3 3.40
WideResNet 98.7 4.50
SqueezeNet 98.8 0.67
SENet 97.8 2.8
MobileNetV3 99.2 1.3
EfficientNet 98.8 1.57

landmark:

License plate corner detection

text recognition:

CTC Loss - base

face recognition:

Triplet Loss - base

number between images is the distance of the pair images
threshold can be set to 1.0 to determine if the two images are from the same person.

About

DNN learning: Classification、Landmark、CTC OCR


Languages

Language:Python 100.0%