VGGish: A VGG-like audio classification model
This repository provides a VGGish model, implemented in Keras with tensorflow backend. This repository is developed based on the model for AudioSet. For more details, please visit the slim version.
Pretrained weights in Keras h5py:
Reference:
-
Gemmeke, J. et. al., AudioSet: An ontology and human-labelled dataset for audio events, ICASSP 2017
-
Hershey, S. et. al., CNN Architectures for Large-Scale Audio Classification, ICASSP 2017