Sihan Chen (TXH-mercury)

TXH-mercury

User data from Github https://github.com/TXH-mercury

Company:Institute of Automation, Chinese Academy of Sciences

Location:Beijing

GitHub:@TXH-mercury

Sihan Chen's repositories

VALOR

[TPAMI2024] Codes and Models for VALOR: Vision-Audio-Language Omni-Perception Pretraining Model and Dataset

Language:PythonLicense:MITStargazers:283Issues:11Issues:23

VAST

[NIPS2023] Code and Model for VAST: A Vision-Audio-Subtitle-Text Omni-Modality Foundation Model and Dataset

Language:Jupyter NotebookLicense:MITStargazers:272Issues:17Issues:27

COSA

[ICLR2024] Codes and Models for COSA: Concatenated Sample Pretrained Vision-Language Foundation Model

Language:PythonLicense:MITStargazers:43Issues:2Issues:4