ArishSultan / arkitekt_ocr

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

For PDF image extraction use following command

python pdf_utils.py <input-pdf> <pages-count>

Use lesser pages-count for testing purpose


For applying OCR on the extracted images use following command

python main.py <input-image>

NOTE: only the images extracted by pdf_utils having name *_1.png can be used as input to main.py

About


Languages

Language:Python 100.0%