Phan Hoang's repositories
KIE_invoice_minimal
Key information extraction from invoice document with Graph Convolution Network
MiniGemini
Official implementation for Mini-Gemini
ChatIE
official repository for ChatIE paper and a tool of IE using ChatGPT. Note: we set defaul openai key. See issues for the solution of gpt3.5-turbo request limit. The response speed depends on openai. ( sometimes, the official is too crowded and the speed/model will be slow/overloaded.)
DocRes
[CVPR 2024] DocRes: A Generalist Model Toward Unifying Document Image Restoration Tasks
EdgeFormer
Source code of "EdgeFormer: Improving Light-weight ConvNets by Learning from Vision Transformers"
ERNIE-Layout-Pytorch
An unofficial Pytorch implementation of ERNIE-Layout which is originally released through PaddleNLP.
GOT-OCR2.0
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
GTR
Scene text recognition
lightning-pose
Accelerated pose estimation and tracking using semi-supervised convolutional networks.
LLaMA-Adapter
Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters
MRN
MRN: Multiplexed Routing Network for Incremental Multilingual Text Recognition (ICCV 2023)
nougat
Implementation of Nougat Neural Optical Understanding for Academic Documents
nougat-latex-ocr
Codes for fine-tuning / evaluating nougat-based image2latex generation models
python-mastery
Advanced Python Mastery (course by @dabeaz)
segment-anything-fast
A batched offline inference oriented version of segment-anything
SEMv3
The official PyTorch implementation of SEMv3.
StructEqTable-Deploy
A High-efficiency Open-source Toolkit for Table-to-Latex Task
UniMERNet
UniMERNet: A Universal Network for Real-World Mathematical Expression Recognition
Union14M
[ICCV 2023] Code base for Revisiting Scene Text Recognition: A Data Perspective
unitable
UniTable: Towards a Unified Table Foundation Model
UVDoc
Code for the paper "UVDoc: Neural Grid-based Document Unwarping"
VLM-R1
Solve Visual Understanding with Reinforced VLMs
yolo_tracking
A collection of SOTA real-time, multi-object tracking algorithms for object detectors
yolov5_obb
yolov5 + csl_label.(Oriented Object Detection)(Rotation Detection)(Rotated BBox)基于yolov5的旋转目标检测