Fernando López Gavilánez's repositories
iterative-pseudo-forced-alignment-ctc
The code for the https://arxiv.org/pdf/2210.15226.pdf
Audiomer-PyTorch
A Convolutional Transformer for Keyword Spotting (AAAI 2022 DSTC Workshop)
crnn-audio-classification
UrbanSound classification using Convolutional Recurrent Networks in PyTorch
ctc-segmentation
Segment an audio file and obtain utterance alignments. (Python package)
denoiser
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.
sonopytorch
Torch implementation of Sonopy
speechbrain
A PyTorch-based Speech Toolkit
transformer-corrector
Transformer-based Spanish corrector
DESED_task
Domestic environment sound event detection task
diart
Lightweight python library for streaming speaker diarization in real-time implemented in pytorch
EfficientAT
This repository aims at providing efficient CNNs for Audio Tagging. We provide AudioSet pre-trained models ready for downstream training and extraction of audio embeddings.
ferugit.github.io
W. Fernando López Gavilánez public page
performer-pytorch
An implementation of Performer, a linear attention-based transformer, in Pytorch
pytorch_introduction
Several pytorch projects
speaker-recognition-exploration
Speaker Recognition Exploration
ssast
Code for the AAAI 2022 paper "SSAST: Self-Supervised Audio Spectrogram Transformer".
transducer-tutorial
Example code for a neural transducer model.
udacity-deep-learning
Udacity Deep Learning Course
wuw-challenge-2024
Baseline of the Wake-up Word Challenge of the 2024 Albayzin Evaluations