Chuang Gan's repositories

CLEVRER

PyTorch implementation of ICLR 2020 paper "CLEVRER: CoLlision Events for Video REpresentation and Reasoning"

Foley-Music

PyTorch implementation of ECCV 2020 paper "Foley Music: Learning to Generate Music from Videos "

find_fallen_objects

Official implementation of CVPR 2022 paper "Finding Fallen Objects Via Asynchronous Audio-Visual Integration".

Language:PythonLicense:MITStargazers:6Issues:2Issues:1

GAT

Graph Attention Networks (https://arxiv.org/abs/1710.10903)

Language:PythonLicense:MITStargazers:1Issues:1Issues:0

imageqa-san

code for Stacked attention networks for image question answering

Language:PythonStargazers:1Issues:0Issues:0

rnn

Recurrent Neural Network library for Torch7's nn

Language:LuaLicense:BSD-3-ClauseStargazers:1Issues:0Issues:0

SCN_for_video_captioning

Using Semantic Compositional Networks for Video Captioning

Language:PythonStargazers:1Issues:1Issues:0

Semantic_Compositional_Nets

The Theano code for the CVPR 2017 paper "Semantic Compositional Networks for Visual Captioning"

Language:PythonStargazers:1Issues:1Issues:0

SfMLearner

An unsupervised learning framework for depth and ego-motion estimation from monocular videos

Language:Jupyter NotebookLicense:MITStargazers:1Issues:0Issues:0

stylenet-1

A pytorch implemention of "StyleNet: Generating Attractive Visual Captions with Styles"

Language:PythonStargazers:1Issues:1Issues:0

TensorFlow-Tutorials

Simple tutorials using Google's TensorFlow Framework

Language:Jupyter NotebookStargazers:1Issues:0Issues:0

tvnet

End-to-End Learning of Motion Representation for Video Understanding

Language:PythonStargazers:1Issues:1Issues:0

Youtube-8M

PaddlePaddle models for Youtube-8M Video Understanding Challenge

Language:PythonLicense:Apache-2.0Stargazers:1Issues:1Issues:0

mesh-transformer-jax

Model parallel transformers in JAX and Haiku

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

sound-spaces

A first-of-its-kind acoustic simulation platform for audio-visual embodied AI research. It supports training and evaluating multiple tasks and applications.

Language:PythonLicense:CC-BY-4.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

vqs

VQS: Linking Segmentations to Questions and Answers for Supervised Attention in VQA and Question-Focused Semantic Segmentation

Language:PythonStargazers:0Issues:1Issues:0