xlihub / OCR_preprocessing_tool

A simple OCR preprocessing tool using Python with a GUI.

OCR_preprocessing_tool

A simple OCR preprocessing tool using Python with a GUI.

This repo is modified from https://github.com/insaneyilin/document_scanner, and note_shrink.py is modified from https://github.com/mzucker/noteshrink.

Usage

GUI - image rotation, binarization, edge detection, dilation/erosion, automatic/manual doc scanner, color inversion, and pdf to png conversion:
```
python OCR_preprocessing_tool.py
```

Command Line - automatic doc scanner:

python doc_scanner_app.py --image=<input_image_path>

Command Line - text compressing and enhancing:
```
python note_shrink.py IMAGE <input_image_path>
```
Run the code below for more tips:
```
python note_shrink.py -h
```

Dependencies

Python 3
Tkinter
OpenCV
Pillow
NumPy
Scipy
pdf2image

pip install -r requirements.txt

Demo

Rotation

Binarization

Edge detection

Erosion

Dilation

Select corners manually

Auto detection (not very robust)

Text enhancement (after applying perspective transform)

Conversion of pdf to png

References

https://github.com/insaneyilin/document_scanner

https://github.com/mzucker/noteshrink

http://www.pyimagesearch.com/2014/09/01/build-kick-ass-mobile-document-scanner-just-5-minutes/

https://www.geeksforgeeks.org/convert-pdf-to-image-using-python/

https://www.geeksforgeeks.org/how-to-hide-recover-and-delete-tkinter-widgets/

http://vipulsharma20.blogspot.com/2016/01/document-scanner-using-python-opencv.html

https://github.com/lancebeet/imagemicro

About

A simple OCR preprocessing tool using Python with a GUI.

MIT License

Languages

Language:Python 100.0%