huyhoang17

Phan Hoang's repositories

KIE_invoice_minimal

Key information extraction from invoice document with Graph Convolution Network

Language:PythonMIT56 1 3

MiniGemini

Official implementation for Mini-Gemini

Language:PythonApache-2.0100

official repository for ChatIE paper and a tool of IE using ChatGPT. Note: we set defaul openai key. See issues for the solution of gpt3.5-turbo request limit. The response speed depends on openai. ( sometimes, the official is too crowded and the speed/model will be slow/overloaded.)

Language:Python000

colpali

Language:Python000

DocRes

[CVPR 2024] DocRes: A Generalist Model Toward Unifying Document Image Restoration Tasks

Language:PythonMIT000

EdgeFormer

Source code of "EdgeFormer: Improving Light-weight ConvNets by Learning from Vision Transformers"

Language:Python000

ERNIE-Layout-Pytorch

An unofficial Pytorch implementation of ERNIE-Layout which is originally released through PaddleNLP.

Language:PythonMIT000

GOT-OCR2.0

Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Language:Python000

GTR

Scene text recognition

Language:PythonApache-2.0000

Hermes-Function-Calling

Language:PythonMIT000

huyhoang17

010

lightning-pose

Accelerated pose estimation and tracking using semi-supervised convolutional networks.

Language:Jupyter NotebookMIT000

LLaMA-Adapter

Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters

Language:PythonGPL-3.0000

MRN

MRN: Multiplexed Routing Network for Incremental Multilingual Text Recognition (ICCV 2023)

Language:PythonMIT000

nougat

Implementation of Nougat Neural Optical Understanding for Academic Documents

Language:PythonMIT000

nougat-latex-ocr

Codes for fine-tuning / evaluating nougat-based image2latex generation models

Language:PythonApache-2.0000

ObjectBox

Language:PythonGPL-3.0000

python-mastery

Advanced Python Mastery (course by @dabeaz)

Language:PythonCC-BY-SA-4.0000

segment-anything-fast

A batched offline inference oriented version of segment-anything

Language:PythonApache-2.0000

SEMv2

Language:Python000

SEMv3

The official PyTorch implementation of SEMv3.

Language:PythonApache-2.0000

StructEqTable-Deploy

A High-efficiency Open-source Toolkit for Table-to-Latex Task

Language:PythonApache-2.0000

UniMERNet

UniMERNet: A Universal Network for Real-World Mathematical Expression Recognition

Language:Jupyter NotebookApache-2.0000

Union14M

[ICCV 2023] Code base for Revisiting Scene Text Recognition: A Data Perspective

Language:PythonMIT000

unitable

UniTable: Towards a Unified Table Foundation Model

Language:Jupyter NotebookMIT000

UVDoc

Code for the paper "UVDoc: Neural Grid-based Document Unwarping"

Language:C++MIT000

VLM-R1

Solve Visual Understanding with Reinforced VLMs

Language:PythonApache-2.0000

yolo_tracking

A collection of SOTA real-time, multi-object tracking algorithms for object detectors

Language:PythonAGPL-3.0000

yolov5_obb

yolov5 + csl_label.(Oriented Object Detection)（Rotation Detection）（Rotated BBox）基于yolov5的旋转目标检测

Language:PythonGPL-3.0000