NjuHaoZhang / awesome-audio-visual

A curated list of different papers and datasets in various areas of audio-visual processing

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Awesome Audio-Visual: Awesome

A curated list of papers and datsets for various audio-visual tasks, inspired by awesome-computer-vision.

Contents

Audio-Visual Localization

Audio-Visual Separation

Audio-Visual Representation/Classification

Audio-Visual Action Recognition

Audio-Visual Spatial/Depth

Audio-Visual Navigation/RL

Audio-Visual Faces/Speech

Cross-modal Generation (Audio-Video / Video-Audio)

Multi-modal Architectures

Uncategorized Papers

Datasets

General Audio-Visual Tasks

Face-Voice Dataset

Licenses

License

CC0

To the extent possible under law, Kranti Kumar Parida has waived all copyright and related or neighboring rights to this work.

Contributing

Please feel free to send me pull requests or email (kranti@cse.iitk.ac.in) to add links, correct wrong ones or if you find any broken links.

About

A curated list of different papers and datasets in various areas of audio-visual processing