Khanh Nguyen's repositories
wikipedia_captioning
Show, Interpret & Tell - AAAI 2023
AoANet
Code for paper "Attention on Attention for Image Captioning". ICCV 2019
Language:PythonMIT000
LaBERT
A length-controllable and non-autoregressive image captioning model.
Language:Python000
M5_VisualRecognition
M5 Visual recognition Group 7
Language:Python000
MCV_M5_VisualRecognition
MCV 2020 - M5 Project: Object Detection and Segmentation - Group 6
Language:Java000
transform-and-tell
[CVPR 2020] Transform and Tell: Entity-Aware News Image Captioning
Language:Python000
ViLT
Code for the ICML 2021 (long talk) paper: "ViLT: Vision-and-Language Transformer Without Convolution or Region Supervision"
Language:PythonApache-2.0000
DiT
Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"
NOASSERTION000
Language:PythonApache-2.0000