Phan Hoang's repositories
kuzushiji_recognition
[Late Submission] Solution for Kuzushiji recognition (Kaggle competition)
Word2Vec_Recommender_System
[Tutorial] - Applying Word2Vec technique to Recommendation System a.k.a Item2Vec a.k.a Prod2Vec
ALBEF
Code for ALBEF: a new vision-language pre-training method
Alpha-IoU
Alpha-IoU: A Family of Power Intersection over Union Losses for Bounding Box Regression
benchmarking-gnns
Repository for benchmarking graph neural networks
competitions
Solutions to Recommender Systems competitions
deep-text-recognition-benchmark
PyTorch code of my ICDAR 2021 paper Vision Transformer for Fast and Efficient Scene Text Recognition (ViTSTR)
DensePhrases
ACL'2021: Learning Dense Representations of Phrases at Scale
docformer
Implementation of DocFormer: End-to-End Transformer for Document Understanding, a multi-modal transformer based architecture for the task of Visual Document Understanding (VDU)
GNN-Recommender-Systems
An index of recommendation algorithms that are based on Graph Neural Networks.
GNN-RecSys
Graph Neural Networks for Recommender Systems
Graphormer
This is the official implementation for "Do Transformers Really Perform Bad for Graph Representation?".
graphtransformer
Graph Transformer Architecture. Source code for "A Generalization of Transformer Networks to Graphs", DLG-AAAI'21.
image-to-latex
Convert images of LaTex math equations into LaTex code.
LaTeX-OCR
pix2tex: Using a ViT to convert images of equations into LaTeX code.
MASTER-mmocr
Re-implementation of MASTER by mmocr
MASTER-TF
MASTER
meta-transfer-learning
TensorFlow and PyTorch implementation of "Meta-Transfer Learning for Few-Shot Learning" (CVPR2019)
pren
Code for "Primitive Representation Learning for Scene Text Recognition" (CVPR 2021)
Representation-Learning-for-Information-Extraction
Pytorch implementation of Paper by Google Research - Representation Learning for Information Extraction from Form-like Documents.
straug
Image transformations designed for Scene Text Recognition (STR) data augmentation. Published at ICCV 2021 Workshop on Interactive Labeling and Data Augmentation for Vision.
synthtiger
Official implementation of SynthTIGER (Synthetic Text Image GEneratoR) ICDAR 2021
TableMASTER-mmocr
2nd solution of ICDAR 2021 Competition on Scientific Literature Parsing, Task B.
Triton-TensorRT-Inference-CRAFT-pytorch
Advanced inference pipeline using NVIDIA Triton Inference Server for CRAFT Text detection (Pytorch), included converter from Pytorch -> ONNX -> TensorRT, Inference pipelines (TensorRT, Triton server - multi-format). Supported model format for Triton inference: TensorRT engine, Torchscript, ONNX
VisionLAN
A PyTorch implementation of "From Two to One: A New Scene Text Recognizer with Visual Language Modeling Network" (ICCV2021)