Official implementation of the paper "DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection"
The official code for “DocTr: Document Image Transformer for Geometric Unwarping and Illumination Correction”, ACM MM, Oral Paper, 2021.
Unofficial PyTorch Implementation of "DOLG: Single-Stage Image Retrieval with Deep Orthogonal Fusion of Local and Global Features"
Source code of "EdgeFormer: Improving Light-weight ConvNets by Learning from Vision Transformers"
Source code for GNN-LSPE (Graph Neural Networks with Learnable Structural and Positional Representations), ICLR 2022
Representing Long-Range Context for Graph Neural Networks with Global Attention
Scene text recognition
HybridNets: End-to-End Perception Network
Code for "LoFTR: Detector-Free Local Feature Matching with Transformers", CVPR 2021
Code for the paper "MASTER: Multi-Aspect Non-local Network for Scene Text Recognition" (Pattern Recognition 2021)
Official PyTorch implementation for Multi-modal Text Recognition Networks: Interactive Enhancements between Visual and Semantic Features (MATRN).
Pytorch implementation of "M-LSD: Towards Light-weight and Real-time Line Segment Detection"
The code and the DIW dataset for "Learning From Documents in the Wild to Improve Document Unwarping" (SIGGRAPH 2022)
Scene Text Recognition with Permuted Autoregressive Sequence Models (ECCV 2022)
Implementation of Principal Neighbourhood Aggregation for Graph Neural Networks in PyTorch, DGL and PyTorch Geometric
SOTA Semantic Segmentation Models in PyTorch
source code for Table Generation
Repo for "TableParser: Automatic Table Parsing with Weak Supervision from Spreadsheets" at SDU@AAAI-22
Vision Transformer Cookbook with Tensorflow
The first competitive instance segmentation approach that runs on small edge devices at real-time speeds.
You Only Look at One Sequence (NeurIPS 2021)