There are 0 repository under multimodal-alignment topic.
[Reproduce] Code for the ACL2019 paper "Multimodal Transformer for Unaligned Multimodal Language Sequences".
A generalized self-supervised training paradigm for unimodal and multimodal alignment and fusion.
Multimodal alignment of images and point clouds on the Modelnet-40-C dataset
Using a 3D Nearby Self-Attention Transformer to leverage the spatiotemporal nature of video for representation learning.