This is a paper list for the multimodal dialogue models.
Keyword: Dialgue model, Multimodal dialogue, Natural Language Processing
Visual Dialog, CVPR 2017 [code]
OpenViDial: A Large-Scale, Open-Domain Dialogue Dataset with Visual Contexts, Arxiv 2020
Image-Chat: Engaging Grounded Conversations, ACL 2020
MELD: A Multimodal Multi-Party Dataset for Emotion Recognition in Conversations, ACL 2019 [code]
CLEVR-Dialog: A Diagnostic Dataset for Multi-Round Reasoning in Visual Dialog, NAACL 2019 [code]
Audio Visual Scene-Aware Dialog, CVPR 2019
CLEVR-Dialog: A Diagnostic Dataset for Multi-Round Reasoning in Visual Dialog, NAACL 2019
Talk the Walk: Navigating New York City through Grounded Dialogue, arXiv 2018
Game-Based Video-Context Dialogue, EMNLP 2018
Towards Building Large Scale Multimodal Domain-Aware Conversation Systems, arXiv 2017 [code]
All-in-one image-grounded conversa-tional agents, Arxiv 2020
Two Causal Principles for Improving Visual Dialog, CVPR 2020
Dialog-based Interactive Image Retrieval, NeurIPS 2018 [code]
Multi-modal open-domain dialogue, arxiv 2020
The Dialogue Dodecathlon: Open-Domain Knowledge and Image Grounded Conversational Agents, ACL2020
Knowledge-aware multi-modal dialogue systems, MM 2018
Image-Grounded Conversations: Multimodal Context for Natural Question and Response Generation, IJCNLP 2017
Visual Coreference Resolution in Visual Dialog using Neural Module Networks, ECCV 2018 [code]
Vision-and-Dialog Navigation, arXiv 2019 [code]
Multimodal Hierarchical Reinforcement Learning Policy for Task-Oriented Visual Dialog, SIGDIAL 2018
Trends in Integration of Vision and Language Research: A Survey of Tasks, Datasets, and Methods, JAIR 2020
Multimodal dialogue on social media, Social Semiotics 2018
Welcome to open issues or make pull requests!