There are 0 repository under multimodal-machine-learning topic.
Reading list for Multimodal Large Language Models
MIntRec 2.0 is the first large-scale dataset for multimodal intent recognition and out-of-scope detection in multi-party conversations (ICLR 2024)
Multimodal datasets for Machine-Learning
This repository is cloned from https://github.com/HLR/LatentAlignmentProcedural. This is a potential baseline explored for the textual_cloze task on the RecipeQA Dataset - https://hucvl.github.io/recipeqa/