Fernando López Gavilánez's repositories

iterative-pseudo-forced-alignment-ctc

The code for the https://arxiv.org/pdf/2210.15226.pdf

Language:PythonLicense:MITStargazers:2Issues:2Issues:1

ctc-loss

A PyTorch implementation of CTCLoss (for learning purposes)

Language:PythonStargazers:1Issues:2Issues:0

Audiomer-PyTorch

A Convolutional Transformer for Keyword Spotting (AAAI 2022 DSTC Workshop)

Language:PythonStargazers:0Issues:0Issues:0

crnn-audio-classification

UrbanSound classification using Convolutional Recurrent Networks in PyTorch

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

ctc-segmentation

Segment an audio file and obtain utterance alignments. (Python package)

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

degan

Deep Effect Generation using GANs

Language:PythonStargazers:0Issues:2Issues:1

denoiser

Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0

quark

Efficient Keyword Spotting

Language:PythonStargazers:0Issues:1Issues:0

sonopytorch

Torch implementation of Sonopy

Language:PythonStargazers:0Issues:2Issues:0

speechbrain

A PyTorch-based Speech Toolkit

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

transformer-corrector

Transformer-based Spanish corrector

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

DESED_task

Domestic environment sound event detection task

Language:Jupyter NotebookStargazers:0Issues:1Issues:0

diart

Lightweight python library for streaming speaker diarization in real-time implemented in pytorch

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

EfficientAT

This repository aims at providing efficient CNNs for Audio Tagging. We provide AudioSet pre-trained models ready for downstream training and extraction of audio embeddings.

License:MITStargazers:0Issues:0Issues:0

examples

TensorFlow examples

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:1Issues:0

ferugit.github.io

W. Fernando López Gavilánez public page

Language:HTMLStargazers:0Issues:2Issues:0

performer-pytorch

An implementation of Performer, a linear attention-based transformer, in Pytorch

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

pytorch_introduction

Several pytorch projects

Language:Jupyter NotebookStargazers:0Issues:2Issues:0
Language:Jupyter NotebookStargazers:0Issues:0Issues:0

speaker-recognition-exploration

Speaker Recognition Exploration

Language:Jupyter NotebookStargazers:0Issues:1Issues:0

ssast

Code for the AAAI 2022 paper "SSAST: Self-Supervised Audio Spectrogram Transformer".

Language:PythonStargazers:0Issues:0Issues:0

transducer-tutorial

Example code for a neural transducer model.

Language:Jupyter NotebookStargazers:0Issues:1Issues:0

udacity-deep-learning

Udacity Deep Learning Course

Language:Jupyter NotebookLicense:MITStargazers:0Issues:2Issues:0

wuw-challenge-2024

Baseline of the Wake-up Word Challenge of the 2024 Albayzin Evaluations

Language:PythonStargazers:0Issues:0Issues:0