ocr

There are 233 repositories under ocr topic.

tesseract-ocr / tesseract
Tesseract Open Source OCR Engine (main repository)
hacktoberfest lstm machine-learning ocr ocr-engine tesseract tesseract-ocr
Language:C++ 63353
PaddlePaddle / PaddleOCR
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
chineseocr crnn db ocr ocrlite
Language:Python 45269
tesseract.js
naptha / tesseract.js
Pure Javascript OCR for more than 100 Languages 📖🎉🖥
deep-learning javascript ocr tesseract webassembly
Language:JavaScript 35595
ShareX
ShareX / ShareX
ShareX is a free and open source program that lets you capture or record any area of your screen and share it with a single press of a key. It also allows uploading images, text or other types of files to many supported destinations you can choose from.
capture color-picker csharp dropbox file-sharing file-upload ftp gif gif-recorder image-annotation imgur ocr productivity region-capture screen-capture screen-recorder screenshot share sharex url-shortener
Language:C# 30208
Umi-OCR
hiroi-sora / Umi-OCR
OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片，PDF文档识别，排除水印/页眉页脚，扫描/生成二维码。内置多国语言库。
ocr ocr-python paddleocr qml qt screenshot umi-ocr
Language:Python 28166
JaidedAI / EasyOCR
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
cnn crnn data-mining deep-learning easyocr image-processing information-retrieval lstm machine-learning ocr optical-character-recognition python pytorch scene-text scene-text-recognition
Language:Python 24971
siyuan
siyuan-note / siyuan
A privacy-first, self-hosted, fully open source personal knowledge management software, written in typescript and golang.
note-taking pkm local-first knowledge-base markdown s3 ocr chatgpt openai notion obsidian evernote pdf notebook webdav self-hosted anki notes-app electron
Language:TypeScript 23903
paperless-ngx / paperless-ngx
A community-supported supercharged version of paperless: scan, index and archive all your physical documents
angular archiving django dms document-management document-management-system machine-learning ocr optical-character-recognition pdf
Language:Python 23264
opendatalab / MinerU
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具，将PDF转换成Markdown和JSON格式。
ai4science document-analysis extract-data layout-analysis ocr parser pdf pdf-converter pdf-extractor-llm pdf-extractor-pretrain pdf-extractor-rag pdf-parser python
Language:Python 22000
OCRmyPDF
ocrmypdf / OCRmyPDF
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
image-processing ocr pdf python tesseract
Language:Python 14443
LaTeX-OCR
lukas-blecher / LaTeX-OCR
pix2tex: Using a ViT to convert images of equations into LaTeX code.
dataset deep-learning im2latex im2markup im2text image-processing image2text latex latex-ocr machine-learning math-ocr ocr python pytorch transformer vision-transformer vit
Language:Python 13144
DayBreak-u / chineseocr_lite
超轻量级中文ocr，支持竖排文字识别, 支持ncnn、mnn、tnn推理 ( dbnet(1.8M) + crnn(2.5M) + anglenet(378KB)) 总模型仅4.7M
ncnn ocr pytorch
Language:C++ 11867
pot-desktop
pot-app / pot-desktop
🌈一个跨平台的划词翻译和OCR软件 | A cross-platform software for text translation and recognition.
linux macos ocr pot pot-app recognize tauri translate translation tts windows
Language:JavaScript 10899
sml2h3 / ddddocr
带带弟弟通用验证码识别OCR pypi版
captcha ddddocr ocr
Language:Python 10697
Unstructured-IO / unstructured
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
data-pipelines deep-learning document-image-analysis document-image-processing document-parser document-parsing docx donut information-retrieval langchain llm machine-learning ml natural-language-processing nlp ocr pdf pdf-to-json pdf-to-text preprocessing
Language:HTML 9560
dataelement / bisheng
BISHENG is an open LLM devops platform for next generation Enterprise AI applications. Powerful and comprehensive features include: GenAI workflow, RAG, Agent, Unified model management, Evaluation, SFT, Dataset Management, Enterprise-level System Management, Observability and more.
agent ai chatbot enterprise finetune genai gpt langchian llama llm llmdevops llmops ocr openai orchestration python rag react sft workflow
Language:Python 9067
ripperhe / Bob
Bob 是一款 macOS 平台的翻译和 OCR 软件。
bobapp chatgpt deepseek doubao ernie gemini groq hunyuan kimi macos ocr openai qwen translate translation translator zhipuai
9040
the-paperless-project / paperless
Scan, index, and archive all of your paper documents
archiving documents ocr paper search
Language:Python 7865
microsoft / ailab
Experience, Learn and Code the latest breakthrough innovations with Microsoft AI
ai algorithms azure-functions bing-search bot computer-vision csharp custom-vision dnn html5 image-classification iot javascript language-learning luis object-detection ocr translation
Language:C# 7741
Easydict
tisfeng / Easydict
一个简洁优雅的词典翻译 macOS App。开箱即用，支持离线 OCR 识别，支持有道词典，🍎 苹果系统词典，🍎 苹果系统翻译，OpenAI，Gemini，DeepL，Google，Bing，腾讯，百度，阿里，小牛，彩云和火山翻译。A concise and elegant Dictionary and Translator macOS App for looking up words and translating text.
app baidu bing deepl dictionary gemini google macos ocr openai shortcuts tencent translate translator youdao
Language:Objective-C 7695
getomni-ai / zerox
PDF to Markdown with vision models
ocr pdf
Language:Python 7241
tesseract-ocr / tessdata
Trained models with fast variant of the "best" LSTM models + legacy models
ocr tesseract
6546
YaoFANGUK / video-subtitle-extractor
视频硬字幕提取，生成srt文件。无需申请第三方API，本地实现文本识别。基于深度学习的视频字幕提取框架，包含字幕区域检测、字幕内容提取。A GUI tool for extracting hard-coded subtitle (hardsub) from videos and generating srt files.
deep-learning ocr subtitles srt hardsub extract ripper subrip
Language:Python 6348
PyMuPDF
pymupdf / PyMuPDF
PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.
data-science epub extract-data font mupdf ocr pdf pdf-documents pymupdf python table-extraction tesseract text-processing text-shaping xps
Language:Python 6049
Swift-AI / Swift-AI
The Swift machine learning library.
artificial-intelligence deep-learning ios machine-learning macos ocr swift
Language:Swift 6031
chineseocr / chineseocr
yolo3+ocr
yolo3 chinese-text-detect chinese-ocr opencv-dnn darknet-text-detect idcard trainticket ocr
Language:Python 5966
clovaai / donut
Official Implementation of OCR-free Document Understanding Transformer (Donut) and Synthetic Document Generator (SynthDoG), ECCV 2022
computer-vision document-ai eccv-2022 multimodal-pre-trained-model nlp ocr
Language:Python 5927
omniparse
adithya-s-k / omniparse
Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks
ingestion-api ocr omniparser parse-server parser-library vision-transformer web-crawler whisper-api
Language:Python 5903
Parsr
axa-group / Parsr
Transforms PDF, Documents and Images into Enriched Structured Data
data document extraction hacktoberfest images nlp ocr parsr pdf python typescript
Language:JavaScript 5879
zyddnys / manga-image-translator
Translate manga/image 一键翻译各类图片内文字 https://cotrans.touhou.ai/
anime auto-translation chinese-translation deep-learning image-processing inpainting japanese-translations machine-translation manga neural-network ocr pytorch-implementation text-detection text-detection-recognition transformer
Language:Python 5579
jonaswinkler / paperless-ng
A supercharged version of paperless: scan, index and archive all your physical documents
angular archiving django dms document-management-system full-text-search machine-learning ocr search
Language:Python 5380
eSearch
xushengfeng / eSearch
截屏离线OCR 搜索翻译以图搜图贴图录屏万向滚动截屏屏幕翻译 Screenshot Offline OCR Search Translate Search for picture Paste the picture on the screen Screen recorder Omnidirectional scrolling screenshot Screen translator
clipboard color-picker cross-platform electron image-editing image-editor live-text ocr paddleocr screen-capture screen-recorder screenshot search search-photos
Language:TypeScript 5141
PaddlePaddle / PaddleX
All-in-One Development Tool based on PaddlePaddle（飞桨低代码开发工具）
classification segmentation deployment ocr time-series pp-chatocr ai-pipelines object-detection
Language:Python 4997
Layout-Parser / layout-parser
A Unified Toolkit for Deep Learning Based Document Image Analysis
computer-vision deep-learning detectron2 document-image-processing document-layout-analysis layout-analysis layout-detection layout-parser object-detection ocr
Language:Python 4972
NMAC427 / SwiftOCR
Fast and simple OCR library written in Swift
ocr swift ocr-library optical-character-recognition ocr-engine ios macos swiftocr deprecated
Language:Swift 4621
Tencent / TNN
TNN: developed by Tencent Youtu Lab and Guangying Lab, a uniform deep learning inference framework for mobile、desktop and server. TNN is distinguished by several outstanding features, including its cross-platform capability, high performance, model compression and code pruning. Based on ncnn and Rapidnet, TNN further strengthens the support and performance optimization for mobile devices, and also draws on the advantages of good extensibility and high performance from existed open source efforts. TNN has been deployed in multiple Apps from Tencent, such as Mobile QQ, Weishi, Pitu, etc. Contributions are welcome to work in collaborative with us and make TNN a better framework.
coreml deep-learning face-detection hairsegmentaion inference mnn ncnn ocr openvino pytorch tengine tensorflow tensorrt
Language:C++ 4438

ocr

tesseract-ocr / tesseract

PaddlePaddle / PaddleOCR

naptha / tesseract.js

ShareX / ShareX

hiroi-sora / Umi-OCR

JaidedAI / EasyOCR

siyuan-note / siyuan

paperless-ngx / paperless-ngx

opendatalab / MinerU

ocrmypdf / OCRmyPDF

lukas-blecher / LaTeX-OCR

DayBreak-u / chineseocr_lite

pot-app / pot-desktop

sml2h3 / ddddocr

Unstructured-IO / unstructured

dataelement / bisheng

ripperhe / Bob

the-paperless-project / paperless

microsoft / ailab

tisfeng / Easydict

getomni-ai / zerox

tesseract-ocr / tessdata

YaoFANGUK / video-subtitle-extractor

pymupdf / PyMuPDF

Swift-AI / Swift-AI

chineseocr / chineseocr

clovaai / donut

adithya-s-k / omniparse

axa-group / Parsr

zyddnys / manga-image-translator

jonaswinkler / paperless-ng

xushengfeng / eSearch

PaddlePaddle / PaddleX

Layout-Parser / layout-parser

NMAC427 / SwiftOCR

Tencent / TNN