sunny's repositories
awesome-ocr
A curated list of promising OCR resources
core
Read-only LibreOffice core repo - no pull request (use gerrit instead https://gerrit.libreoffice.org/) - don't download zip, use https://dev-www.libreoffice.org/bundles/ instead
dhSegment
Generic framework for historical document processing
DocumentLayoutAnalysis
Document Layout Analysis resources repos for development with PdfPig.
DuiLib_Ultimate
Duilib 旗舰版-高清屏、多语言、样式表、资源管理器、异形窗口、窗口阴影、简单动画
java-design-patterns
Design patterns implemented in Java
libharu
libharu - free PDF library
o2oa
开源OA系统 - 码云GVP|Java开源oa|企业OA办公平台|企业OA|协同办公OA|流程平台OA|O2OA|OA,支持国产麒麟操作系统和国产数据库(达梦、人大金仓),政务OA,军工信息化OA
ocrad.js
OCR in Javascript via Emscripten
ocrsegment
a deep learning model for page layout analysis / segmentation.
ofdrw
依照《GB/T 33190-2016 电子文件存储与交换格式版式文档》实现的OFD版式文档,读写库。
okular
KDE document viewer
open-license-manager
An open license manager written in c++
pdf-annotate.js
Annotation layer for pdf.js (no longer maintained)
PDF-Explained
《PDF 解析》
pdf2docx
Parse PDF file with PyMuPDF and generate docx with python-docx
pdfalto
PDF to XML ALTO file converter
pdffigures2
Given a scholarly PDF, extract figures, tables, captions, and section titles.
pdftitle
a utility to extract the title from a PDF file
proguard-with-maven-example
How to ProGuard with Apache Maven