incubated-geek-cc / Tess4JOcrApp

An Image-to-Text OCR Extraction Tool built in Java with Tess4J, a JNA wrapper class for Tesseract.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

logo

๐Ÿ”Ž Tess4JOcrApp

๐Ÿ› ๏ธ Image-to-Text extraction from image and PDF files. Uses the open-sourced Tesseract OCR Engine & JNA Java Wrapper Class.

A native Desktop application built with Tess4J in Java.

๐Ÿ“Œ Features

  • Multiple image/PDF file uploads
  • Text extraction from image/PDF files
  • Export to text file

โœ Read related posts here

Article One :: Link :: Build a Portable OCR Tool in 4 Steps with Tess4J โ€” A Tesseract Wrapper for Java


Article Two :: Link :: Building an OCR Native Application Tool with Tess4J โ€” Extract Text from PDF in just 3 steps


Application GUI as of Sep 2022

๐ŸŒŸ Application GUI as of Feb 2024

Note: As of current, the latest version is available in the folder at v4x.


License

Both Tesseract and this Software are licensed under the Apache License, Version 2.0 (the "License");
you may not use this file except in compliance with the License.
You may obtain a copy of the License at

http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software
distributed under the License is distributed on an "AS IS" BASIS,
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
See the License for the specific language governing permissions and
limitations under the License.

โ€” Join me on ๐Ÿ“ Medium at ~ ฮพ(๐ŸŽ€หถโ›โ—กโ›) @geek-cc


๐ŸŒฎ Please buy me a Taco! ๐Ÿ˜‹

About

An Image-to-Text OCR Extraction Tool built in Java with Tess4J, a JNA wrapper class for Tesseract.

License:Apache License 2.0


Languages

Language:Java 99.9%Language:Makefile 0.1%