There are 7 repositories under document-analysis topic.
A curated list of resources for Document Understanding (DU) topic
A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Tongyi Lab, Alibaba Group.
This repository provides train&test code, dataset, det.&rec. annotation, evaluation script, annotation tool, and ranking.
Code for the paper "PICK: Processing Key Information Extraction from Documents using Improved Graph Learning-Convolutional Networks" (ICPR 2020)
Pandora is an analysis framework to discover if a file is suspicious and conveniently show the results
AssemblyLine 4: File triage and malware analysis
RObust document image BINarization
Local adaptive image binarization
Document Visual Question Answering
Post-process Amazon Textract results with Hugging Face transformer models for document understanding
(ICFHR 2020 oral) Code for "docExtractor: An off-the-shelf historical document element extraction" paper
Powerful web application that combines Streamlit, LangChain, and Pinecone to simplify document analysis. Powered by OpenAI's GPT-3, RAG enables dynamic, interactive document conversations, making it ideal for efficient document retrieval and summarization.
Effortlessly extract information from unstructured data with this library, utilizing advanced AI techniques. Compose AI in customizable pipelines and diverse sources for your projects.
An unofficial PyTorch implementation of "Lin et al. ViBERTgrid: A Jointly Trained Multi-Modal 2D Document Representation for Key Information Extraction from Documents. ICDAR, 2021"
Trained Detectron2 object detection models for document layout analysis based on PubLayNet dataset
Improving Document Binarization via Adversarial Noise-Texture Augmentation (ICIP 2019)
Code for ICPR2022 paper: "Graph Neural Networks and Representation Embedding for table extraction in PDF Documents"
UTRNet: High-Resolution Urdu Text Recognition In Printed Documents (ICDAR'23)
DL models that take a document image file as input, locate the position of paragraphs, lines, images, etc. with their labels and confidence scores.
Enhanced Document Understanding on AWS delivers an easy-to-use web application that ingests and analyzes documents, extracts content, identifies and redacts sensitive customer information, and creates search indexes from the analyzed data.
Trained Detectron2 object detection models for document layout analysis based on PubLayNet dataset
[Late Submission] Solution for Kuzushiji recognition (Kaggle competition)
Official PyTorch implementation of PyramidTabNet: Transformer-based Table Recognition in Image-based Documents
Adobe CEP extension for InDesign to use the Bookalope cloud services.
A fast and accurate command line tool for extracting text from PDF files.
Java Client for the expert.ai Natural Language API
CVL/READ Modules including Basic Layout Analysis and Writer Identification/Retrieval
The code for MetaLDA in ICDM 2017
All the material (paper, code, dataset, results) of our DAS 2022 paper (OCR+NER benchmark)
Python code to perform keyword spotting using SIFT features
An advanced AI-powered generic document-analysis tool