tignis / pytessy

Faster access to Tesseract-OCR from Python

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

PyTessy - Tesseract-OCR, faster!

This module allows faster access to Tesseract-OCR from Python scripts.

Why and when is it so fast?

PyTessy uses direct library-level access to Tesseract-OCR's core library. Therefore is it so fast in case when the image is already in the memory or when the image need to be processed before scanning with Tesseract-OCR. In case of reading and scanning existing files only PyTessy is just a bit faster than usual Tesseract-OCR Python wrappers.

Requirements

Operating system

PyTessy is operating system independent in case if you set the exact location of your Tesseract-OCR library since presently library search process is implemented on Windows only.

Python modules

PyTessy uses only modules from the Standard Library only. Python version must be >= 3.6.

External requirements

You have to have installed or portable version of Tesseract-OCR (at least a working library and tessdata).

You can download Tesseract-OCR from here.

Installation

You can install the latest PyTessy version with pip install pytessy or you can download the wheel from this repository or you can build it from the source code.

Documentation

PyTessy has a ReadTheDocs page

About

Faster access to Tesseract-OCR from Python

License:Boost Software License 1.0


Languages

Language:Python 100.0%