java ocr ocr-recognition tess4j tesseract tesseract-ocr

🔎 Tess4JOcrApp

🛠️ Image-to-Text extraction from image and PDF files. Uses the open-sourced Tesseract OCR Engine & JNA Java Wrapper Class.

A native Desktop application built with Tess4J in Java.

📌 Features

Multiple image/PDF file uploads
Text extraction from image/PDF files
Export to text file

✍ Read related posts here

Article One :: Link :: Build a Portable OCR Tool in 4 Steps with Tess4J — A Tesseract Wrapper for Java

Article Two :: Link :: Building an OCR Native Application Tool with Tess4J — Extract Text from PDF in just 3 steps

Application GUI as of Sep 2022

🌟 Application GUI as of Feb 2024

Note: As of current, the latest version is available in the folder at v4x.

License

Both Tesseract and this Software are licensed under the Apache License, Version 2.0 (the "License");
you may not use this file except in compliance with the License.
You may obtain a copy of the License at

http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software
distributed under the License is distributed on an "AS IS" BASIS,
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
See the License for the specific language governing permissions and
limitations under the License.

— Join me on 📝 Medium at ~ ξ(🎀˶❛◡❛) @geek-cc

🌮 Please buy me a Taco! 😋

About

An Image-to-Text OCR Extraction Tool built in Java with Tess4J, a JNA wrapper class for Tesseract.

java ocr ocr-recognition tess4j tesseract tesseract-ocr

Apache License 2.0

Languages

Language:Java 99.9%Language:Makefile 0.1%