happlydata's repositories

Speech-Separation-Paper-Tutorial

A must-read paper for speech separation based on neural networks

Stargazers:0Issues:0Issues:0

Tacotron2-PyTorch

Yet another PyTorch implementation of Tacotron 2 with reduction factor and faster training speed.

License:MITStargazers:0Issues:0Issues:0

HGCN

The official repo of "HGCN: Harmonic Gated Compensation Network For Speech Enhancement"

Stargazers:0Issues:0Issues:0

CMGAN

Conformer-based Metric GAN for speech enhancement

License:MITStargazers:0Issues:0Issues:0

FullSubNet-plus

The official PyTorch implementation of "FullSubNet+: Channel Attention FullSubNet with Complex Spectrograms for Speech Enhancement".

License:Apache-2.0Stargazers:0Issues:0Issues:0

CDEC

Cross-Domain Echo Controller

License:MITStargazers:0Issues:0Issues:0

AEC3

AEC3 Extracted From WebRTC

Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

MusicYOLO

MusicYOLO framework uses the object detection model, YOLOx, to locate notes in the spectrogram.

License:Apache-2.0Stargazers:0Issues:0Issues:0

DeepFilterNet

Noise supression using deep filtering

License:NOASSERTIONStargazers:0Issues:0Issues:0

DB-AIAT

A dual-branch attention-in-attention transformer (dubbed DB-AIAT) to focus on both coarse and fine-grained regions of spectrum in parallel, i.e., spectral magnitude and lost complex spectral details. The source code will be released soon

Stargazers:0Issues:0Issues:0

pyaec

simple and efficient python implemention of a series of adaptive filters. including time domain adaptive filters(lms、nlms、rls、ap、kalman)、nonlinear adaptive filters(volterra filter、functional link adaptive filters)、frequency domain adaptive filters(frequency domain adaptive filter、frequency domain kalman filter) for acoustic echo cancellation.

License:Apache-2.0Stargazers:0Issues:0Issues:0

speechbrain

A PyTorch-based Speech Toolkit

License:Apache-2.0Stargazers:0Issues:0Issues:0

midi_degradation_toolkit

A toolkit for generating datasets of midi files which have been degraded to be 'un-musical'.

License:MITStargazers:0Issues:0Issues:0

wavebeat

End-to-end beat and downbeat tracking in the time domain.

License:GPL-3.0Stargazers:0Issues:0Issues:0

LibriMix

An open source dataset for source separation

License:MITStargazers:0Issues:0Issues:0

LibtorchTutorials

This is a code repository for pytorch c++ (or libtorch) tutorial.

License:Apache-2.0Stargazers:0Issues:0Issues:0

voice-activity-detection

Pytorch implementation of SELF-ATTENTIVE VAD, ICASSP 2021

License:MITStargazers:0Issues:0Issues:0

kissfft

a Fast Fourier Transform (FFT) library that tries to Keep it Simple, Stupid

License:NOASSERTIONStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

autodsp

Train custom adaptive filter optimizers without hand tuning or extra labels.

License:NOASSERTIONStargazers:0Issues:0Issues:0
License:Apache-2.0Stargazers:0Issues:0Issues:0

Complex_PF

RES via complex-valued DNN

Stargazers:0Issues:0Issues:0

segan

Speech Enhancement Generative Adversarial Network in TensorFlow

License:MITStargazers:0Issues:0Issues:0

NRES

Neural Residual Echo Suppressor

License:MITStargazers:0Issues:0Issues:0

specmix

This is a project of Interspeech2021 paper "SpecMix : A Mixed Sample Data Augmentation method for Training with Time-Frequency Domain Features"

License:MITStargazers:0Issues:0Issues:0

TSEGAN

The implement of the paper 'Time-domain Speech Enhancement with GenerativeAdversarial Learning'

Stargazers:0Issues:0Issues:0

DSDPRNN

Implementation of Dual-Stream DPRNN (paper: Nonlinear Residual Echo Suppression Based on Dual-Stream DPRNN)

Stargazers:0Issues:0Issues:0

VAD

Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.

Stargazers:0Issues:0Issues:0

rnnoise_16k

implementation of rnnoise_16k

License:BSD-3-ClauseStargazers:0Issues:0Issues:0