nbhupendra / athento-imaging

This module includes some useful operations to improve images of documents for a better Undestanding Documents

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Code Climate Build Status Coverage Status

Athento-Imaging

Athento-Imaging is a Python library developed using OpenCV to improve OCR in documents. Among the documents tested using this library are: passports, bills, delivery notes, budgets, and other common documents.

You can check everything out in the Athento-Imaging Summary.

The quality of the output and it's OCR performance will depend on:

  • The quality of the source document, as the quality value increases so does the OCR.
  • The amount of noise in the document and it's position.
  • The presence and position of the document's watermarks.
  • The colour of the document. Clear colours are easier to remove than darker colours due to the proximity of the pixel values between the background and the text.
  • Your personal experience in image transformation, as you might need to perform a combination of operations or change the parameters values significantly.

About

This module includes some useful operations to improve images of documents for a better Undestanding Documents


Languages

Language:Python 100.0%