Irene Martín Morató's starred repositories
ssl4birdsounds
Self-supervised representation learning for bird sounds (ICASSPW SASB 2024)
HTS-Audio-Transformer
The official code repo of "HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection"
DCASE2021_task6_v2
Code for CVSSP submission to DCASE 2021 Task 6
TextToAudioGrounding
The dataset and baseline code for Text-to-Audio Grounding (TAG)
whisper-at
Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event Taggers"
aac-datasets
Audio Captioning datasets for PyTorch.
transformer_workshop
Code for the Transformer workshop
audio-and-speech-tech-2022
Audio and Speech Technologies Workshop 2022, code examples
interpretable_predictions
Interpretable Neural Predictions with Differentiable Binary Variables
dcase_util
A collection of utilities for Detection and Classification of Acoustic Scenes and Events
pytorchforaudio
Code for the "PyTorch for Audio + Music Processing" series on The Sound of AI YouTube channel.
dcase_datalist
Collection of DCASE related datasets