ocr-python

There are 13 repositories under ocr-python topic.

Umi-OCR
hiroi-sora / Umi-OCR
OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片，PDF文档识别，排除水印/页眉页脚，扫描/生成二维码。内置多国语言库。
paddleocr ocr ocr-python umi-ocr qml qt screenshot
Language:Python 37360
CnOCR
breezedeus / CnOCR
CnOCR: Awesome Chinese/English OCR Python toolkits based on PyTorch. It comes with 20+ well-trained models for different application scenarios and can be used directly after installation. 【基于 PyTorch/MXNet 的中文/英文 OCR Python 包。】
chinese-character-recognition english-character-recognition ocr ocr-python pytorch
Language:Python 3495
CatchTheTornado / text-extract-api
Document (PDF, Word, PPTX ...) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents. Remove PII. Convert any document or picture to structured JSON or Markdown
anonymization api extract json llm ocr ocr-python pdf pii
Language:Python 2529
hiroi-sora / Umi-OCR_v2
结束和新的开始
ocr ocr-python paddleocr qml qt
Language:QML 946
Psarpei / Multi-Type-TD-TSR
Extracting Tables from Document Images using a Multi-stage Pipeline for Table Detection and Table Structure Recognition
image-processing deep-learning table-structure-recognition table-detection table-detection-using-deep-learning ocr ocr-recognition ocr-python natural-language-processing nlp nlp-machine-learning machine-learning machine-learning-algorithms algorithms computer-vision computer-science computer-vision-algorithms computer-vision-opencv
Language:Jupyter Notebook 281
maxent-ai / ocrpy
OCR, Archive, Index and Search: Implementation agnostic OCR framework.
aws azure cv google-vision-api information-retrieval nlp ocr ocr-python semantic-search tesseract-ocr transformers python computer-vision deep-learning image-processing
Language:Jupyter Notebook 223
MrZilinXiao / Hyper-Table-OCR
A carefully-designed OCR pipeline for universal boarded table recognition and reconstruction.
deep-learning ocr ocr-python table-extraction table-ocr
Language:C++ 177
nathanaday / RealTime-OCR
Perform text detection in a variety of languages with your computer webcam using Google Tesseract OCR and OpenCV. This script achieves a real-time OCR effect via multi-threading.
ocr ocr-python pytesseract cv2 opencv-python multithreading python
Language:Python 172
fast-plate-ocr
ankandrew / fast-plate-ocr
Lightweight & fast OCR models for license plate text recognition.
plate-recognition plate-ocr license-plate-recognition albumentations jax keras3 license-plate-reader onnx pytorch tensorflow license-plate-ocr keras ocr-python ocr license-plate-check license-plate
Language:Python 131
ilic5000 / pabkvizgenerator
Anansi is a computer vision (cv2 and FFmpeg) + OCR (EasyOCR and tesseract) python-based crawler for finding and extracting questions and correct answers from video files of popular TV game shows in the Balkan region.
computer-vision easyocr ocr-python opencv python quiz-app quiz-game tesseract
Language:Python 125
blueaxis / Cloe
Manga OCR snipping application for desktop
manga-ocr ocr ocr-python pyqt5 snipping-tool
Language:Python 113
prp-e / persian_ocr_project
A FLOSS software for Persian Optical Character Recognition
ocr ocr-python ocr-recognition
Language:Jupyter Notebook 89
nainiayoub / pdf-text-data-extractor
PDF text data extraction web app with OCR for scanned documents
pdf-to-text streamlit streamlit-webapp text-extraction python ocr ocr-python ocr-text-reader pdf
Language:Python 88
kartikgill / Easter2
Easter2.0: IMPROVING CONVOLUTIONAL MODELS FOR HANDWRITTEN TEXT RECOGNITION
handwriting-ocr handwriting-recognition handwritten-text-recognition htr iam-dataset ocr ocr-python optical-character-recognition python3 easter2
Language:Jupyter Notebook 79
shibing624 / imgocr
Python3 package for Chinese/English OCR, with paddleocr-v4 onnx model(~14MB). 基于ppocr-v4-onnx模型推理，可实现 CPU 上毫秒级的 OCR 精准预测，通用场景中英文OCR达到开源SOTA。
chinese-ocr ocr ocr-python
Language:Python 73
tamil_ocr
gnana70 / tamil_ocr
OCR Tamil is a powerful tool that can detect and recognize text in Tamil images with high accuracy on Natural Scenes
indic-languages indic-scripts ocr optical-character-recognition python scene-text-detection scene-text-detection-recognition scene-text-recognition tamil tamil-language ocr-python tamil-ocr ocr-tamil ocr-recognition computer-vision natural-language-processing tamil-nlp transformer handwriting-recognition handwritten-text-recognition
Language:Python 71
ksasso1028 / EasyOCR-cpp
Custom C++ implementation of deep learning based OCR
cpp deployment easyocr inference inference-engine libtorch ocr ocr-python ocr-recognition ocr-text-reader optical-character-recognition text-detection text-recognition
Language:C++ 55
bentoml / BentoOCR
Turn any OCR models into online inference API endpoint 🚀 🌖
ocr ocr-python ai-applications model-deployment model-serving
Language:Python 54
genieincodebottle / parsemypdf
Collection of PDF parsing libraries like AI based docling, claude, openai, llama-vision, unstructured-io, and pdfminer, pymupdf, pdfplumber etc for efficient snapshot, text, table, and metadata extraction.
camelot claude docling llama-parse markitdown openai pymupdf pypdf unstructured-io llama-vision ocr ocr-python omniai smoldocling
Language:Python 54
X-T-E-R / my-little-ocr
MyLittleOCR 是一个统一的 OCR 库包装器，提供一致的 API，便于集成和切换多个 OCR 引擎。 MyLittleOCR is a unified OCR wrapper providing a consistent API for seamless integration and switching between multiple OCR engines.
easyocr ocr ocr-python paddleocr rapidocr surya tesseract wrapper mylittle
Language:Python 51
oidlabs-com / Lexoid
Multimodal document parser for high quality data understanding and extraction
llms pdf-document parser-library pdf-parser multimodal genai large-language-models ocr ocr-python
Language:Python 42
MauryaRitesh / OCR-Python
Optical Character Recognition in Python.
ocr-recognition ocr-python ocr python-ocr google google-images-downloader pytesseract tesseract-ocr tesseract-ocr-api tesseract-python
Language:Jupyter Notebook 41
sepehrraisi / Persian-OCR
A project to bring high accuracy OCR to Persian language.
ocr ocr-python ocr-recognition persian-ocr
Language:Shell 35
zefoy-captcha-solver
xtekky / zefoy-captcha-solver
Zefoy OCR captcha solver | 99% accurate
captcha captcha-solver ocr ocr-python ocr-recognition python python-3 zefoy
Language:Python 33
ASACHIT / OCR-django-app
A django webapp to scan text from image , faster, easy & efficient
django tailwindcss tailwind ocr-recognition ocr-python ocr-text-reader webapp webapplication
Language:CSS 28
sergiocorreia / quipucamayoc
dev repo for article
ocr ocr-post-processing ocr-python poppler table-extraction table-ocr textract
Language:Python 28
Baskar-forever / TableExtractor-Advanced-PDF-Table-Extraction
PDF Table Extractor is an innovative Python project designed to tackle the challenge of extracting tables from scanned PDF documents. Leveraging advanced optical character recognition (OCR) and image processing techniques.
ocr-python scanedpdf-extraction table-extraction table-extraction-python table-structure-recognition
Language:Jupyter Notebook 27
Unstructured-IO / community
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
community data-pipeline deep-learning document-ai document-parsing machine-learning nlp-parsing ocr-python open-source preprocessing-data
27
Employee-Monitoring-Using-Object-Detection
pgplarosa / Employee-Monitoring-Using-Object-Detection
Deep Learning Individual Project - March 03, 2022.
employee-management image-processing object-detection ocr-python yolov4 computer-vision pytesseract
Language:HTML 25
Jan-9C / deathcounter_ocr
A python script which detects death messages by using OCR and displays a corrosponding deathcounter. Preconfigured for Elden Ring
death-counter deathcounter gaming souls-games souls-like soulslike streaming twitch video-game videogame videogames ocr ocr-python ocr-recognition adaptable configurable image-processing elden-ring eldenring
Language:Python 22
FtmsdtHosseini / IDPL-PFOD
An Image Dataset of Printed Farsi Text for OCR Research
database dataset ocr ocr-recognition ocr-python image-classification image-processing text-processing image-generators image-generation python farsi-datasets farsi farsi-ocr persian-ocr persian-ocr-dataset farsi-ocr-dataset persian-dataset
21
ayseceyda / analog-meter-reading-openCV
AMR (automatic meter reading) project for analog meters, built with openCV+Python using basic OCR and image processing knowledge.
ocr-python opencv-python opencv image-processing amr python3 ocr analog meter keras-tensorflow cnn
Language:Jupyter Notebook 20
butlerlabs / docai
DocAI helps developers quickly build document, image and text processing pipelines using open source and cloud-based machine learning models for a wide range of applications
ocr python computer-vision information-extraction information-retrieval language-models machine-learning machine-learning-library natural-language-processing nlp nlp-library ocr-python pretrained-models
Language:Python 20
hungtooc / transaction_ocr
The open source extract transaction infomation by using OCR.
ocr-python google-ocr api python transaction ocr
Language:Python 20
yunwoong7 / korean_ocr_using_paddleOCR
This is a Korean OCR Python code using the paddleOCR library
ocr ocr-korean ocr-python ocr-recognition paddleocr python
Language:Jupyter Notebook 19
Hermann-web / python-OCR
Converting invoice pdf to image, image to text and then get, from the text, invoice informations like invoice number or vendor name
ocr tesseract pdf python ocr-recognition ocr-python ocr-text-reader image-to-text pdf-to-image invoice-pdf invoice-number
Language:Jupyter Notebook 18

ocr-python

hiroi-sora / Umi-OCR

breezedeus / CnOCR

CatchTheTornado / text-extract-api

hiroi-sora / Umi-OCR_v2

Psarpei / Multi-Type-TD-TSR

maxent-ai / ocrpy

MrZilinXiao / Hyper-Table-OCR

nathanaday / RealTime-OCR

ankandrew / fast-plate-ocr

ilic5000 / pabkvizgenerator

blueaxis / Cloe

prp-e / persian_ocr_project

nainiayoub / pdf-text-data-extractor

kartikgill / Easter2

shibing624 / imgocr

gnana70 / tamil_ocr

ksasso1028 / EasyOCR-cpp

bentoml / BentoOCR

genieincodebottle / parsemypdf

X-T-E-R / my-little-ocr

oidlabs-com / Lexoid

MauryaRitesh / OCR-Python

sepehrraisi / Persian-OCR

xtekky / zefoy-captcha-solver

ASACHIT / OCR-django-app

sergiocorreia / quipucamayoc

Baskar-forever / TableExtractor-Advanced-PDF-Table-Extraction

Unstructured-IO / community

pgplarosa / Employee-Monitoring-Using-Object-Detection

Jan-9C / deathcounter_ocr

FtmsdtHosseini / IDPL-PFOD

ayseceyda / analog-meter-reading-openCV

butlerlabs / docai

hungtooc / transaction_ocr

yunwoong7 / korean_ocr_using_paddleOCR

Hermann-web / python-OCR