Huy Q Can's repositories
coursera-deep-learning-specialization
Notes, programming assignments and quizzes from all courses within the Coursera Deep Learning specialization offered by deeplearning.ai: (i) Neural Networks and Deep Learning; (ii) Improving Deep Neural Networks: Hyperparameter tuning, Regularization and Optimization; (iii) Structuring Machine Learning Projects; (iv) Convolutional Neural Networks; (v) Sequence Models
receipt-kie
key information extraction for receipt for fun =))
Awesome-Document-Image-Rectification
A comprehensive list of awesome document image rectification papers.
awesome-Face_Recognition
papers about Face Detection; Face Alignment; Face Recognition && Face Identification && Face Verification && Face Representation; Face Reconstruction; Face Tracking; Face Super-Resolution && Face Deblurring; Face Generation && Face Synthesis; Face Transfer; Face Anti-Spoofing; Face Retrieval;
Awesome-Table-Recognition
A curated list of resources dedicated to table recognition
data-science-interviews
Data science interview questions and answers
deep-learning-drizzle
Drench yourself in Deep Learning, Reinforcement Learning, Machine Learning, Computer Vision, and NLP by learning from these exciting lectures!!
doc3D-dataset
A hybrid dataset for document unwarping (Paper: https://www3.cs.stonybrook.edu/~cvl/projects/dewarpnet/storage/paper.pdf)
docile
DocILE: Document Information Localization and Extraction Benchmark
ERNIE-Layout-Pytorch
An unofficial Pytorch implementation of ERNIE-Layout which is originally released through PaddleNLP.
FAST
Faster Arbitrarily-Shaped Text Detector with Minimalist Kernel Representation
MATRN
Official PyTorch implementation for Multi-modal Text Recognition Networks: Interactive Enhancements between Visual and Semantic Features (MATRN) in ECCV 2022.
MC-OCR
The task aims at extracting required fields in receipts captured by mobile devices :smile:
PraNet-Neoplasm-Detection
PraNet: Parallel Reverse Attention Network for Polyp Segmentation, MICCAI 2020 (Oral). Code using Jittor Framework is available.
ResNext-imagewoof
implement resnext
Speech-and-Language-Processing
note something while reading Speech and Language Processing third edition.
unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
vietocr
Transformer OCR
Violence-detection-project
Computer vision project