Chuang Gan's repositories
Foley-Music
PyTorch implementation of ECCV 2020 paper "Foley Music: Learning to Generate Music from Videos "
find_fallen_objects
Official implementation of CVPR 2022 paper "Finding Fallen Objects Via Asynchronous Audio-Visual Integration".
imageqa-san
code for Stacked attention networks for image question answering
SCN_for_video_captioning
Using Semantic Compositional Networks for Video Captioning
Semantic_Compositional_Nets
The Theano code for the CVPR 2017 paper "Semantic Compositional Networks for Visual Captioning"
SfMLearner
An unsupervised learning framework for depth and ego-motion estimation from monocular videos
stylenet-1
A pytorch implemention of "StyleNet: Generating Attractive Visual Captions with Styles"
TensorFlow-Tutorials
Simple tutorials using Google's TensorFlow Framework
Youtube-8M
PaddlePaddle models for Youtube-8M Video Understanding Challenge
mesh-transformer-jax
Model parallel transformers in JAX and Haiku
sound-spaces
A first-of-its-kind acoustic simulation platform for audio-visual embodied AI research. It supports training and evaluating multiple tasks and applications.