There are 189 repositories under ocr topic.
Tesseract Open Source OCR Engine (main repository)
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
Pure Javascript OCR for more than 100 Languages 📖🎉🖥
OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片,PDF文档识别,排除水印/页眉页脚,扫描/生成二维码。内置多国语言库。
A community-supported supercharged version of paperless: scan, index and archive all your physical documents
A privacy-first, self-hosted, fully open source personal knowledge management software, written in typescript and golang.
超轻量级中文ocr,支持竖排文字识别, 支持ncnn、mnn、tnn推理 ( dbnet(1.8M) + crnn(2.5M) + anglenet(378KB)) 总模型仅4.7M
pix2tex: Using a ViT to convert images of equations into LaTeX code.
🌈一个跨平台的划词翻译和OCR软件 | A cross-platform software for text translation and recognition.
Scan, index, and archive all of your paper documents
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
Trained models with fast variant of the "best" LSTM models + legacy models
yolo3+ocr
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
A supercharged version of paperless: scan, index and archive all your physical documents
视频硬字幕提取,生成srt文件。无需申请第三方API,本地实现文本识别。基于深度学习的视频字幕提取框架,包含字幕区域检测、字幕内容提取。A GUI tool for extracting hard-coded subtitle (hardsub) from videos and generating srt files.
A Unified Toolkit for Deep Learning Based Document Image Analysis
TNN: developed by Tencent Youtu Lab and Guangying Lab, a uniform deep learning inference framework for mobile、desktop and server. TNN is distinguished by several outstanding features, including its cross-platform capability, high performance, model compression and code pruning. Based on ncnn and Rapidnet, TNN further strengthens the support and performance optimization for mobile devices, and also draws on the advantages of good extensibility and high performance from existed open source efforts. TNN has been deployed in multiple Apps from Tencent, such as Mobile QQ, Weishi, Pitu, etc. Contributions are welcome to work in collaborative with us and make TNN a better framework.
Translate manga/image 一键翻译各类图片内文字 https://cotrans.touhou.ai/
OpenMMLab Text Detection, Recognition and Understanding Toolbox
Text recognition (optical character recognition) with deep learning methods, ICCV 2019
text detection mainly based on ctpn model in tensorflow, id card detect, connectionist text proposal network
A synthetic data generator for text recognition