Code for the paper: Audio-Visual Scene Analysis with Self-Supervised Multisensory Features
Home Page:http://andrewowens.com/multisensory/
Geek Repo:Geek Repo
Github PK Tool:Github PK Tool
tuffr5 opened this issue 5 years ago · comments
Hello, Is it (batch_size, channel, depth, height, width) or (b, d, h, w, c) or something else? I'm new to tensorflow and it confuses me a lot. Thanks.