foamliu / CRNN

PyTorch re-implementation of CRNN: Convolutional Recurrent Neural Network

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Convolutional Recurrent Neural Network

Introduction

This is a PyTorch re-implementation of CRNN: Convolutional Recurrent Neural Network (paper). The features are summarized blow:

DataSet

We use the synthetic dataset (mjsynth) released by Jaderberg et al. as the training data. The dataset contains 8 millions training images and their corresponding ground truth words. Such images are generated by a synthetic text engine and are highly realistic.

@article{Jaderberg14c,
      title={Synthetic Data and Artificial Neural Networks for Natural Scene Text Recognition},
      author={Jaderberg, M. and Simonyan, K. and Vedaldi, A. and Zisserman, A.},
      journal={arXiv preprint arXiv:1406.2227},
      year={2014}
    }

Dependency

  • PyTorch 1.1.0

Usage

Data Pre-processing

Extract training & test images:

$ python extract.py

Train

$ python train.py

If you want to visualize during training, run in your terminal:

$ tensorboard --logdir runs

Demo

Pick 10 random examples from test set of mjsynth:

$ python demo.py
Image Word
image U---n-r-e--a-l--is-t-i---c => Unrealistic
image O---V-E-R--R-U--L--E-----S => OVERRULES
image W---E-S--T-M-IIN-S-T-E---R => WESTMINSTER
image m-------o----r--ee-l-----s => morels
image P--E-R-P-L-E-A-I-T-A--R--S => PERPLEAITARS
image E----------D-------------P => EDP
image G-----e-o-lo--giis-t-----s => Geologists
image r-----e----b---a----t---ee => rebate
image F--O-R-E-D-O--O-M--IIN---G => FOREDOOMING
image u--n--re-p-r-es-e-n-te---d => unrepresented

About

PyTorch re-implementation of CRNN: Convolutional Recurrent Neural Network

License:MIT License


Languages

Language:Python 100.0%