huyhoang17

Implementation of DocFormer: End-to-End Transformer for Document Understanding, a multi-modal transformer based architecture for the task of Visual Document Understanding (VDU)

Language:PythonMIT000

GNN-Recommender-Systems

An index of recommendation algorithms that are based on Graph Neural Networks.

010

GNN-RecSys

Graph Neural Networks for Recommender Systems

000

Graphormer

This is the official implementation for "Do Transformers Really Perform Bad for Graph Representation?".

Language:PythonMIT010

graphtransformer

Graph Transformer Architecture. Source code for "A Generalization of Transformer Networks to Graphs", DLG-AAAI'21.

MIT000

image-to-latex

Convert images of LaTex math equations into LaTex code.

Language:PythonMIT000

LaTeX-OCR

pix2tex: Using a ViT to convert images of equations into LaTeX code.

Language:PythonMIT000

MASTER-mmocr

Re-implementation of MASTER by mmocr

Apache-2.0000

meta-transfer-learning

TensorFlow and PyTorch implementation of "Meta-Transfer Learning for Few-Shot Learning" (CVPR2019)

MIT000

nanodet

⚡Super fast and lightweight anchor-free object detection model. 🔥Only 980 KB(int8) / 1.8MB (fp16) and run 97FPS on cellphone🔥

Language:PythonApache-2.0010

pren

Code for "Primitive Representation Learning for Scene Text Recognition" (CVPR 2021)

Language:PythonApache-2.0000

Representation-Learning-for-Information-Extraction

Pytorch implementation of Paper by Google Research - Representation Learning for Information Extraction from Form-like Documents.

Language:PythonApache-2.0000

straug

Image transformations designed for Scene Text Recognition (STR) data augmentation. Published at ICCV 2021 Workshop on Interactive Labeling and Data Augmentation for Vision.

Language:PythonApache-2.0000

synthtiger

Official implementation of SynthTIGER (Synthetic Text Image GEneratoR) ICDAR 2021

Language:PythonMIT000

TableMASTER-mmocr

2nd solution of ICDAR 2021 Competition on Scientific Literature Parsing, Task B.

Apache-2.0000

Triton-TensorRT-Inference-CRAFT-pytorch

Advanced inference pipeline using NVIDIA Triton Inference Server for CRAFT Text detection (Pytorch), included converter from Pytorch -> ONNX -> TensorRT, Inference pipelines (TensorRT, Triton server - multi-format). Supported model format for Triton inference: TensorRT engine, Torchscript, ONNX

Language:PythonBSD-3-Clause000

VisionLAN

A PyTorch implementation of "From Two to One: A New Scene Text Recognizer with Visual Language Modeling Network" (ICCV2021)

000

YOLOF

MIT000

huyhoang17

Phan Hoang's repositories

kuzushiji_recognition

framler

Word2Vec_Recommender_System

ABINet

ALBEF

Alpha-IoU

benchmarking-gnns

competitions

deep-text-recognition-benchmark

DensePhrases

docformer

GNN-Recommender-Systems

GNN-RecSys

Graphormer

graphtransformer

image-to-latex

LaTeX-OCR

MASTER-mmocr

MASTER-TF

meta-transfer-learning

nanodet

pren

Representation-Learning-for-Information-Extraction

spade

straug

synthtiger

TableMASTER-mmocr

Triton-TensorRT-Inference-CRAFT-pytorch

VisionLAN

YOLOF