Please see here(https://yuanbo2020.github.io/Contrastive-AVSC/) to interactively view the label for each point in detail.
Please consider citing our paper as
@inproceedings{hou2022audio,
author={Hou, Yuanbo and Kang, Bo and Botteldooren, Dick},
title={{Audio-visual scene classification via contrastive event-object alignment and semantic-based fusion}},
year=2022,
booktitle={2022 IEEE 24rd International Workshop on Multimedia Signal Processing (MMSP)},
pages={1--6},
}