Audio Event Detection and Localization with Multitask Regression Network

Source code of the submission "Audio Event Detection and Localization with Multitask Regression Network" ranked 6th in the DCASE 2020 SELD challenge.

How to run:

Extract feature bash run_feature_extraction.sh
Experiments on the development data bash run_dcase_dev.sh
Experiments on the evaluation data bash run_dcase_eval.sh

Environment:

Python3
librosa
Tensorflow GPU 1.x (x >= 9) (for network training and evaluation)

Contact:

Huy Phan

Centre for Digital Music (C4DM) Queen Mary University of London Email: h.phan@qmul.ac.uk

License

MIT © Huy Phan

About

Source code of the DCASE 2020 SELD submission "Audio Event Detection and Localization with Multitask Regression Network"

Languages

Language:Python 98.8%Language:Shell 1.2%