Yaoting Wang's repositories
intermodal_incongruity
For ACL Rolling Review
AudioCLIP
Source code for models described in the paper "AudioCLIP: Extending CLIP to Image, Text and Audio" (https://arxiv.org/abs/2106.13043)
CMU-MultimodalSDK
CMU MultimodalSDK is a machine learning platform for development of advanced multimodal models as well as easily accessing and processing multimodal datasets.
COVID-19
Novel Coronavirus (COVID-19) Cases, provided by JHU CSSE
cs4084example
Example project for CS4084 Mobile Application Development
Deep-Reinforcement-Learning
CS4287-Project-03
Generalizable-Audio-Visual-Segmentation
Author version - Official repository of "Prompting Segmentation with Sound is Generalizable Audio-Visual Source Localizer", AAAI 2024
llama
Inference code for LLaMA models
mmt
[ACL'19] [PyTorch] Multimodal Transformer
nEMO
nEmo_speech_emotion_dataset
PRJ
项目报告总结
ProbRobScene
ProbRobScene: A Probabilistic Specification Language for 3D Robotic Manipulation Environments
Project_Report_Overall
项目报告总结
SAM_annotation_tool
A simple semi-automatic labelling tool for semantic segmention masks using SAM as support.
speechrecorder
Basic Python SpeechRecorder clone