Beast code in Giters

A PyTorch implementation of Conv-TasNet described in "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation" with Permutation Invariant Training (PIT).

Language:PythonMIT010

DeepXi

Deep Xi: A deep learning approach to a priori SNR estimation implemented in TensorFlow 2/Keras. For speech enhancement and robust ASR.

Language:MATLABMPL-2.0000

demucs

Code for the paper Music Source Separation in the Waveform Domain

Language:PythonNOASSERTION010

DNN-Phase-Reconstruction

Language:Jupyter Notebook010

dnn_wpe

Language:PythonNOASSERTION010

dual-path-RNNs-DPRNNs-based-speech-separation

A PyTorch implementation of dual-path RNNs (DPRNNs) based speech separation described in "Dual-path RNN: efficient long sequence modeling for time-domain single-channel speech separation".

Language:Python010

ERNN-for-speech-enhancement

Language:PythonMIT010

FaSNet-TAC-PyTorch

Full implementation of "End-to-end microphone permutation and number invariant multi-channel speech separation" (Interspeech 2020)

Language:Python000

knowledge-distillation-pytorch

A PyTorch implementation for exploring deep and shallow knowledge distillation (KD) experiments with flexibility

Language:PythonMIT010

Lip_Reading_in_the_Wild_AVSR

Audio-Visual Speech Recognition using Deep Learning

Language:Python010

mediaio

Language:Python010

onssen

An open-source speech separation and enhancement library

Language:PythonGPL-3.0010

open-unmix-pytorch

Open-Unmix - Music Source Separation for PyTorch

Language:PythonMIT010

pyroomacoustics

Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.

Language:PythonMIT010

python-pesq

A python package for calculating the PESQ.

Language:PythonMIT000

pytorch

000

SincNet

SincNet is a neural architecture for efficiently processing raw audio samples.

Language:PythonMIT010

Sound_Localization_Algorithms

Classical algorithms of sound source localization with beamforming, TDOA and high-resolution spectral estimation.

000

speech-dereverberation

speech-dereverberation-using-GANs

Language:Python010

Speech-Separation-Paper

A must-read paper for speech separation based on neural networks

010

speech_feature_extractor

Some useful features of speech process, such as MFCC, gammatone filterbank, GFCC, spectrum(power spectrum and log-power spectrum), Amplitude Modulation Spectrum(AMS) and so on.

Language:Python010

speechmetrics

A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR

Language:Python010

spleeter

Deezer source separation library including pretrained models.

Language:PythonMIT010

wavenet

Keras WaveNet implementation

Language:Python010