User data from Github https://github.com/TXH-mercury
followers
following
stars
Company:Institute of Automation, Chinese Academy of Sciences
Location:Beijing
GitHub:@TXH-mercury
[TPAMI2024] Codes and Models for VALOR: Vision-Audio-Language Omni-Perception Pretraining Model and Dataset
[NIPS2023] Code and Model for VAST: A Vision-Audio-Subtitle-Text Omni-Modality Foundation Model and Dataset
[ICLR2024] Codes and Models for COSA: Concatenated Sample Pretrained Vision-Language Foundation Model