data's repositories
VQA-MIB
Code For Paper "Model-agnostic information biasing for VQA" CODS-COMAD 2021
LSCM-Refseg
Code for Linguistic Structure Guided Context Modeling for Referring Image Segmentation, ECCV2020.
SSL-VQA
Code for our IJCAI2020 paper: Overcoming Language Priors with Self-supervised Learning for Visual Question Answering
CSS-VQA
Counterfactual Samples Synthesizing for Robust VQA
mfas
Implementation of CVPR 2019 paper "Mfas: Multimodal fusion architecture search"
visdial-bert
Implementation for "Large-scale Pretraining for Visual Dialog" https://arxiv.org/abs/1912.02379
basic_vqa
Pytorch VQA : Visual Question Answering (https://arxiv.org/pdf/1505.00468.pdf)
ControlGAN-Tensorflow
Simple Tensorflow implementation of "ControlGAN: Controllable Text-to-Image Generation" (NeurIPS 2019)
TVQAplus
[ACL 2020] PyTorch code for TVQA+: Spatio-Temporal Grounding for Video Question Answering
murel.bootstrap.pytorch
MUREL (CVPR 2019), a multimodal relational reasoning module for VQA
Image-Captioning-PyTorch
图像中文描述+视觉注意力
vqa.mutan
Visual Question Answering in Pytorch
ATTNGANwithBERT
Implementation of a text to image generator in ATTNGAN paper improved using BERT transformer
a-PyTorch-Project-to-Image-Caption
Image Caption with Attention | a PyTorch Project to Image Caption
Image-Captioning
CNN-Encoder and RNN-Decoder (Bahdanau Attention) for image caption or image to text on MS-COCO dataset. 图片描述
Deep-Convolutional-Generative-Adversarial-Network
Tensorflow 2. This repository demonstrates how to generate images of handwritten digits (MINIST) using a Deep Convolutional Generative Adversarial Network (DCGAN). 深度卷积生成对抗网络
Image-and-Text-Search
Joint representation of image and text through a Canonical Correlation Analysis
vqa-project
Code for our paper: Learning Conditioned Graph Structures for Interpretable Visual Question Answering
Visual-Question-Answering
This is an PyTorch implementation of DMN+ model on MSCOCO VQA dataset.
text-to-image-using-GAN
This repository consists of code that is used to convert text-embeddings into Images using Generative Adversarial Networks(GAN)
PyTorch-FastCampus
PyTorch로 시작하는 딥러닝 입문 CAMP (2017.7~2017.12) 강의자료
Relation-Network-Tensorflow
Tensorflow implementations of Relational Networks and a VQA dataset named Sort-of-CLEVR proposed by DeepMind.
VizWiz-VQA-PyTorch
PyTorch VQA implementation that achieved top performances in the (ECCV18) VizWiz Grand Challenge: Answering Visual Questions from Blind People
image-captioning-3
Image captioning models "show and tell" + "show, attend and tell" in PyTorch
irlc-vqa-counting
Code for Interpretable Counting for Visual Question Answering for ICLR 2018 reproducibility challenge.
iQAN
Visaul Question Generation as Dual Task of Visual Question Answering (PyTorch Version)