MLT

Multimodal Lexical Translation Dataset:

A dataset of ambiguous words and its lexical translations together with visual and textual contexts (i.e. an image and sentence respectively)

Format

English_Word | Lexical_Translation | Textual_Context (A sentence) | Visual_Context (Image id)

The human annotations of 2018 test set is saved in files human.de and human.fr

These are in the same format as above with an extra column where human annotators had indicated that image were used.

English_Word | Lexical_Translations | Textual_Context (A sentence) | Visual_Context (Image id) | Was Image used? (yes/no)