xlihub / OCR_preprocessing_tool

A simple OCR preprocessing tool using Python with a GUI.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

OCR_preprocessing_tool

A simple OCR preprocessing tool using Python with a GUI.

This repo is modified from https://github.com/insaneyilin/document_scanner, and note_shrink.py is modified from https://github.com/mzucker/noteshrink.


Usage

  • GUI - image rotation, binarization, edge detection, dilation/erosion, automatic/manual doc scanner, color inversion, and pdf to png conversion:

    python OCR_preprocessing_tool.py
    
  • Command Line - automatic doc scanner:

    python doc_scanner_app.py --image=<input_image_path>
    
  • Command Line - text compressing and enhancing:

    python note_shrink.py IMAGE <input_image_path>
    

    Run the code below for more tips:

    python note_shrink.py -h
    

Dependencies

  • Python 3
  • Tkinter
  • OpenCV
  • Pillow
  • NumPy
  • Scipy
  • pdf2image
pip install -r requirements.txt

Demo

Rotation

Binarization

Edge detection

Erosion

Dilation

Select corners manually

Auto detection (not very robust)

Text enhancement (after applying perspective transform)

Conversion of pdf to png


References

https://github.com/insaneyilin/document_scanner

https://github.com/mzucker/noteshrink

http://www.pyimagesearch.com/2014/09/01/build-kick-ass-mobile-document-scanner-just-5-minutes/

https://www.geeksforgeeks.org/convert-pdf-to-image-using-python/

https://www.geeksforgeeks.org/how-to-hide-recover-and-delete-tkinter-widgets/

http://vipulsharma20.blogspot.com/2016/01/document-scanner-using-python-opencv.html

https://github.com/lancebeet/imagemicro

About

A simple OCR preprocessing tool using Python with a GUI.

License:MIT License


Languages

Language:Python 100.0%