Esra's repositories
Dialogue-Act-Classification
A multimodal dialogue act classifier
Structural-Priors-in-VQA
A survey investigating the concept space prior in Vision and Language models
VisLang-Paper-Club
Reading group for Vision and Language research
awesome-grounding
awesome grounding: A curated list of research papers in visual grounding
cfvqa
[CVPR 2021] Counterfactual VQA: A Cause-Effect Look at Language Bias
counter
Counterfactual Explainable Recommendation
CSS-VQA
Counterfactual Samples Synthesizing for Robust VQA
GSMN
Implementation of our CVPR2020 paper, Graph Structured Network for Image-Text Matching
handcalcs
Python library for converting Python calculations into rendered latex.
IMS-Toucan
IMS-Toucan is a toolkit to train state-of-the-art Speech Synthesis models. Everything is pure Python and PyTorch based to keep it as simple and beginner-friendly, yet powerful as possible.
L2R2
PyTorch implementation of L2R2 in SIGIR 2020
lcgn
Code release for Hu et al., Language-Conditioned Graph Networks for Relational Reasoning. in ICCV, 2019
mcan-vqa
Deep Modular Co-Attention Networks for Visual Question Answering
merlot
MERLOT: Multimodal Neural Script Knowledge Models
OpenPrompt
An Open-Source Framework for Prompt-Learning.
reinforcement-learning-an-introduction
Python Implementation of Reinforcement Learning: An Introduction
reinforcement_learning_an_introduction
Notes and exercise solutions for second edition of Sutton & Barto's book
unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
visdial_conv
This repository contains code used in our ACL'20 paper History for Visual Dialog: Do we really need it?