continuous repetition letters of the OCR

Question

continuous repetition letters of the OCR

wqt2019 opened this issue 6 years ago · comments

wqt2019 commented 6 years ago

hi Belval , when i retrain the crnn code ,i found that if there were continuous repetition letters of the OCR, the result offer missed the repetition letters and outputed the single letter . Such as the Ground truth of the OCR is '0870011' , '37075337' , and the Prediction is '08701' , '3707537' . If there were no continuous repetition letters ,the result was correct . My training data only include digital provide by your project https://github.com/Belval/TextRecognitionDataGenerator .
Is it the problem of the CTC ? How can i to solve this problem?
ths!

Edouard Belval · Answer 1 · Wed Jul 04 2018 18:53:20 GMT+0800 (China Standard Time)

This is a known tradeoff with CTC. Usually researchers will use a lexicon to overcome this but it doesn't apply in your case.

In the meantime, you can use merge_repeated=False as an additional parameter in the ctc_beam_search_decoder op to prevent merging. Do note that this will probably duplicate some letters.

https://www.tensorflow.org/api_docs/python/tf/nn/ctc_beam_search_decoder

wqt2019 · Answer 2 · Wed Jul 04 2018 21:08:53 GMT+0800 (China Standard Time)

ths . solved