zhongshijun

followers

following

stars

happlydata's repositories

Speech-Separation-Paper-Tutorial

A must-read paper for speech separation based on neural networks

000

Tacotron2-PyTorch

Yet another PyTorch implementation of Tacotron 2 with reduction factor and faster training speed.

MIT000

HGCN

The official repo of "HGCN: Harmonic Gated Compensation Network For Speech Enhancement"

000

CMGAN

Conformer-based Metric GAN for speech enhancement

MIT000

FullSubNet-plus

The official PyTorch implementation of "FullSubNet+: Channel Attention FullSubNet with Complex Spectrograms for Speech Enhancement".

Apache-2.0000

CDEC

Cross-Domain Echo Controller

MIT000

AEC3

AEC3 Extracted From WebRTC

000

L-SpEx

000

MusicYOLO

MusicYOLO framework uses the object detection model, YOLOx, to locate notes in the spectrogram.

Apache-2.0000

DeepFilterNet

Noise supression using deep filtering

NOASSERTION000

DB-AIAT

A dual-branch attention-in-attention transformer (dubbed DB-AIAT) to focus on both coarse and fine-grained regions of spectrum in parallel, i.e., spectral magnitude and lost complex spectral details. The source code will be released soon

000

pyaec

simple and efficient python implemention of a series of adaptive filters. including time domain adaptive filters(lms、nlms、rls、ap、kalman)、nonlinear adaptive filters(volterra filter、functional link adaptive filters)、frequency domain adaptive filters(frequency domain adaptive filter、frequency domain kalman filter) for acoustic echo cancellation.

Apache-2.0000

speechbrain

A PyTorch-based Speech Toolkit

Apache-2.0000

midi_degradation_toolkit

A toolkit for generating datasets of midi files which have been degraded to be 'un-musical'.

MIT000

wavebeat

End-to-end beat and downbeat tracking in the time domain.

GPL-3.0000

LibriMix

An open source dataset for source separation

MIT000

LibtorchTutorials

This is a code repository for pytorch c++ (or libtorch) tutorial.

Apache-2.0000

voice-activity-detection

Pytorch implementation of SELF-ATTENTIVE VAD, ICASSP 2021

MIT000

kissfft

a Fast Fourier Transform (FFT) library that tries to Keep it Simple, Stupid

NOASSERTION000

AttentiveTraining

000

autodsp

Train custom adaptive filter optimizers without hand tuning or extra labels.

NOASSERTION000

ai-research-code

Apache-2.0000

Complex_PF

RES via complex-valued DNN

000

segan

Speech Enhancement Generative Adversarial Network in TensorFlow

MIT000

NRES

Neural Residual Echo Suppressor

MIT000

specmix

This is a project of Interspeech2021 paper "SpecMix : A Mixed Sample Data Augmentation method for Training with Time-Frequency Domain Features"

MIT000

TSEGAN

The implement of the paper 'Time-domain Speech Enhancement with GenerativeAdversarial Learning'

000

DSDPRNN

Implementation of Dual-Stream DPRNN (paper: Nonlinear Residual Echo Suppression Based on Dual-Stream DPRNN)

000

VAD

Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.

000

rnnoise_16k

implementation of rnnoise_16k

BSD-3-Clause000