There are 10 repositories under layout-analysis topic.
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
A Unified Toolkit for Deep Learning Based Document Image Analysis
An Open-Source Python3 tool with SMALL models for recognizing layouts, tables, math formulas (LaTeX), and text in images, converting them into Markdown format. A free alternative to Mathpix, empowering seamless conversion of visual content into text-based representations. 80+ languages are supported.
OCR engine for all the languages
Document Layout Analysis resources repos for development with PdfPig.
A toolbox of ocr models and algorithms based on MindSpore
Analysis of Chinese and English layouts 中英文版面分析
Doc2Graph transforms documents into graphs and exploit a GNN to solve several tasks.
YOLO models trained by DocLayNet - power your Document Intelligent by Layout Analysis
An official implementation of paper "Paragraph2Graph: A Language-independent GNN-based framework for layout analysis"
Trained Detectron2 object detection models for document layout analysis based on PubLayNet dataset
[ICDAR 2023] SelfDocSeg: A self-supervised vision-based approach towards Document Segmentation (Oral)
A Large Dataset of Historical Japanese Documents with Complex Layouts
A Unified Toolkit for Deep Learning-Based Table Extraction
Proof of concept of training a simple Region Classifier using PdfPig and ML.NET (LightGBM). The objective is to classify each text block in a pdf document page as either title, text, list, table and image.
Trained Detectron2 object detection models for document layout analysis based on PubLayNet dataset
利用java-yolov8实现版面检测(Chinese layout detection),java-yolov8 is used to detect the layout of Chinese document images
This library builds a graph-representation of the content of PDFs. The graph is then clustered, resulting page segments are classified and returned. Tables are retrieved formatted as a CSV.
Nordrassil is a keyboard layout that provides an elegant and balanced typing experience by its use of a thumb-alpha, emphasis on middle fingers, deprioritisation of pinkies, and repeat key (or arcane keys).
A more complete example of programming with PDFMiner, which continues where the default documentation stops
A powerful CLI tool for visualization and encoding of PAGE-XML files
A web application for PDF content and table extraction, featuring image-based visual layout analysis, indexed document search, batch processing and extraction result annotation.
Open Dataset for the Recognition and Analysis of Scripts in Arabic Maghrebi (ICDAR 2021, CHR 2024)
A Python + C implementation for image-based PDF page layout analysis and content extraction.
HTR ground truth of the Chi-Know-Po project (Collex Persée)
A python package to structure files using visual and style informations
Layout Parser notebook Implementation & Re-trained model for Image detection and extraction
Raw data of the Catalog of Armenian Manuscripts of Venice
Main repository of the CGPG project for OCR and Text Analysis of the Patrologia Graeca
Live capture your screen and replace textual elements with their translations
Automated Election Vote Counting