duydedai's repositories
ALBEF
Code for ALBEF: a new vision-language pre-training method
bottom-up-attention.pytorch
A PyTorch reimplementation of bottom-up-attention models
HADA-LAVIS
This is a sub-repository of the HADA where we applied HADA on-the-top of LAVIS
CRT_Applying_RF
A simple project in which applying random forest on some basic dataset and making comparison with neural network
CSL_HCMC
Repository for the CityScope project related to Ho Chi Minh City Collaboration
ctpn.pytorch
Pytorch implementation of CTPN (Detecting Text in Natural Image with Connectionist Text Proposal Network)
deep-text-recognition-benchmark
PyTorch code of my ICDAR 2021 paper Vision Transformer for Fast and Efficient Scene Text Recognition (ViTSTR)
DPT
Dense Prediction Transformers
EE514-FakeNews-Detection
EE514 Assignment
EfficientNetV2-pytorch
EfficientNetV2 pytorch (pytorch lightning) implementation with pretrained model
Interview-Assignment
This project is to use MiniZinc to solve the constraints problem about the Interview Assignment Process
LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
LightningDOT
source code and pre-trained/fine-tuned checkpoint for NAACL 2021 paper LightningDOT
monodepth2
[ICCV 2019] Monocular depth estimation from a single image
MPViT
MPViT:Multi-Path Vision Transformer for Dense Prediction
pretty-print-confusion-matrix
Confusion Matrix in Python: plot a pretty confusion matrix (like Matlab) in python using seaborn and matplotlib
research-charnet
CharNet: Convolutional Character Networks
SAGPool
Official PyTorch Implementation of SAGPool - ICML 2019
Scene-Graph-Benchmark.pytorch
A new codebase of Scene Graph Generation based on maskrcnn-benchmark. A Pytorch implementation of the CVPR 2020 Oral paper "Unbiased Scene Graph Generation from Biased Training"
SGRAF
The code of “Similarity Reasoning and Filtration for Image-Text Matching” [AAAI2021]
symspellpy
Python port of SymSpell
vedastr
A scene text recognition toolbox based on PyTorch
X-VLM
X-VLM: Multi-Grained Vision Language Pre-Training (ICML 2022)