About the audio-text pair of AudioSet dataset.
blue-blue272 opened this issue · comments
blue-blue272 commented
AudioSet only contains audio and event labels. How do you obtain the caption description for audios in the audioset dataset?
Yuan Gong commented
Please check this: https://github.com/XinhaoMei/WavCaps. It is in the paper, but probably not very obvious place.
-Yuan