GenjiB's repositories

LAVISH

Vision Transformers are Parameter-Efficient Audio-Visual Learners

Language:PythonLicense:MITStargazers:31Issues:1Issues:6

AVSiam

Siamese Vision Transformers are Scalable Audio-visual Learners

Language:PythonStargazers:5Issues:0Issues:0

awesome-audio-visual

A curated list of different papers and datasets in various areas of audio-visual processing

Stargazers:1Issues:0Issues:0

ast

Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0
Language:C++Stargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:1Issues:0
Stargazers:0Issues:1Issues:0