There are 0 repository under multi-modal-deep-learning topic.
This repository contains the source code for our paper: "Husformer: A Multi-Modal Transformer for Multi-Modal Human State Recognition". For more details, please refer to our paper at https://arxiv.org/abs/2209.15182.
The largest multilingual image-text classification dataset. It contains fashion products.
Official repository of IEEE Research Paper "Chronic Obstructive Pulmonary Disease Severity Classification using lung sound"