So Uchida's starred repositories
deepdoctection
A Repo For Document AI
calibration-framework
The net:cal calibration framework is a Python 3 library for measuring and mitigating miscalibration of uncertainty estimates, e.g., by a neural network.
vscode-extension-samples
Sample code illustrating the VS Code extension API.
ElasticDiffusion-official
The official Pytorch Implementation for ElasticDiffusion: Training-free Arbitrary Size Image Generation (CVPR 2024)
101-People-4538-Images-Japanese-Handwriting-OCR-Data
Japanese Handwriting OCR Dataset
Handwriting-Transformers
Handwriting-Transformers (ICCV21)
convolutional-handwriting-gan
ScrabbleGAN: Semi-Supervised Varying Length Handwritten Text Generation (CVPR20)
DocTr-Plus
The official code for “Deep Unrestricted Document Image Rectification”, TMM, 2023.
small-object-detection-benchmark
icip2022 paper: sahi benchmark on visdrone and xview datasets using fcos, vfnet and tood detectors
sahi_batched
Sahi batched inference (Yolov8 only)
vrdu
We identify the desiderata for a comprehensive benchmark and propose Visually Rich Document Understanding (VRDU). VRDU contains two datasets that represent several challenges: rich schema including diverse data types, complex templates, and diversity of layouts within a single document type.
groundingLMM
[CVPR 2024 🔥] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.
GPT-4V_OCR
Evaluation of the Optical Character Recognition (OCR) capabilities of GPT-4V(ision)