OCR-Text
Optical Character Recognition using Google's Tesseract engine
openCV documentation
Learn open cv fromhere
Learn more about Tesseracttrans_color function
-
Load the example image and convert it to grayscale
-
Check preprocess to apply thresholding on the image
-
Load the image as a PIL/Pillow image
-
Additional processing such as spellchecking for OCR errors or NLP should be applied
In order to run this script
- Open terminal
- Run command in this sequence :
$ python ocr.py -image image.jpg
(Applying Gaussian Blur/Thresholding)
$ python ocr.py -image image.jpg -preprocess blur