sequence-sh / TesseractConnector

This repository is a mirror of https://gitlab.com/sequence/connectors/tesseract

Home Page:https://sequence.sh

Repository from Github https://github.comsequence-sh/TesseractConnectorRepository from Github https://github.comsequence-sh/TesseractConnector

Sequence Tesseract OCR Connector

Sequence® is a collection of libraries for automation of cross-application e-discovery and forensic workflows.

This connector contains steps to perform optical character recognition (OCR) on image files. It uses the Tesseract open source library as the OCR engine.

Prerequisites

The following needs to be installed:

Examples

OCR a bitmap image

- <path> = 'MyImage.bmp'
- <imageData> = FileRead <path>
- <imageFormat> = GetImageFormat <path>
- <imageText> = TesseractOCR <imageData> <imageFormat>
- Print <imageText>

Documentation

https://sequence.sh

Download

https://sequence.sh/download

Try SCL and Core

https://sequence.sh/playground

Package Releases

Can be downloaded from the Releases page.

NuGet Packages

Release nuget packages are available from nuget.org.

About

This repository is a mirror of https://gitlab.com/sequence/connectors/tesseract

https://sequence.sh

License:Apache License 2.0


Languages

Language:C# 100.0%