ilovin/lstm_ctc_ocr

tensorflow warpctc-tensorflow-binding crnn

old master:
- harder to converge compare to the beta version
- both standard ctc and warpCTC
- read data at once
dev:
- the pipline version of lstm_ctc_ocr, resize to same size
- use tf.records
beta (current):
- generate data on the fly
- deal with multi-width image, padding to same width

How to use

./train.sh

Dependency

python 3
tensorflow 1.0.1
captcha
warpCTC tensorflow_binding

Some details

The training data:

Notice that, parameters can be found in ./lstm.yml(higher priority) and lib/lstm/utils/config.y
some parameters need to be fined tune:

learning rate
decay step & decay rate
image_height
optimizer?

in ./lib/lstm/utils/gen.py, the height of the images are the same, and I pad the width to the same for each batch, so if you want to use your own data, the height of the image shall be the same.

Result

The accurary can be more that 95%

Read this blog for more details and this blog for how to use tf.nn.ctc_loss or warpCTC

About

Use CTC + tensorflow to OCR

https://ilovin.github.io/2017-04-06/tensorflow-lstm-ctc-ocr/

tensorflow warpctc-tensorflow-binding crnn

Languages

Language:Python 99.8%Language:Shell 0.2%