speech-separation

There are 56 repositories under speech-separation topic.

speechbrain / speechbrain
A PyTorch-based Speech Toolkit
speech-recognition speech-toolkit speaker-recognition speech-to-text speech-enhancement speech-separation audio audio-processing speech-processing speechrecognition asr voice-recognition spoken-language-understanding speaker-diarization speaker-verification pytorch huggingface transformers language-model deep-learning
Language:Python 7964
espnet / espnet
End-to-End Speech Processing Toolkit
deep-learning end-to-end chainer pytorch kaldi speech-recognition speech-synthesis speech-translation machine-translation voice-conversion speech-enhancement speech-separation singing-voice-synthesis speaker-diarization spoken-language-understanding
Language:Python 7932
asteroid-team / asteroid
The PyTorch-based audio source separation toolkit for researchers
source-separation speech-separation audio-separation speech-enhancement deep-learning pytorch pretrained-models
Language:Python 2124
coqui-ai / open-speech-corpora
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
tts stt speech-to-text text-to-speech speech-recognition speech-synthesis speech-processing voice-recognition voice-activity-detection voice-cloning speech-emotion-recognition speech-separation
1214
maum-ai / voicefilter
Unofficial PyTorch implementation of Google AI's VoiceFilter system
source-separation audio-separation speech-separation pytorch voicefilter
Language:Python 1035
JusperLee / Speech-Separation-Paper-Tutorial
A must-read paper for speech separation based on neural networks
paper speech-enhancement speech-separation voice-separation
708
kaituoxu / Conv-TasNet
A PyTorch implementation of Conv-TasNet described in "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation" with Permutation Invariant Training (PIT).
speech-separation source-separation audio-separation pit pytorch tasnet conv-tasnet permutation-invariant-training
Language:Python 641
Audio-WestlakeU / FullSubNet
PyTorch implementation of "FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."
speech-enhancement speech-processing speech-separation pytorch pretrained-model paper full-band sub-band single-channel noise-reduction denoising audio band narrow-band reproducible-research speech
Language:Python 508
anicolson / DeepXi
Deep Xi: A deep learning approach to a priori SNR estimation implemented in TensorFlow 2/Keras. For speech enhancement and robust ASR.
resnet tensorflow speech-enhancement robust-asr deepxi a-priori-snr-estimator mmse minimum-mean-square-error mmse-lsa residual-networks deep-xi speech-separation noise-estimation source-separation deepmmse tcn keras attention mhanet multi-head-attention
Language:MATLAB 486
gemengtju / Tutorial_Separation
This repo summarizes the tutorials, datasets, papers, codes and tools for speech separation and speaker extraction task. You are kindly invited to pull requests.
speech-separation speech-processing speech-analysis deep-learning deep-neural-networks signal-processing
Language:MATLAB 411
JusperLee / Dual-Path-RNN-Pytorch
Dual-path RNN: efficient long sequence modeling for time-domain single-channel speech separation implemented by Pytorch
pytorch speech-separation-algorithm deep-learning rnn-model speech-separation
Language:Python 393
microsoft / UniSpeech
UniSpeech - Large Scale Self-Supervised Learning for Speech
pytorch speech-recognition speech-processing speech diarization speech-separation speech-diarization speaker-verification
Language:Python 393
funcwj / setk
Tools for Speech Enhancement integrated with Kaldi
kaldi speech-enhancement beamforming speech speech-separation rir-generator time-frequency-masking
Language:Python 390
JusperLee / Conv-TasNet
Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation Pytorch's Implement
pytorch speech-separation deep-learning cnn-architecture
Language:Python 386
posenhuang / deeplearningsourceseparation
Deep Recurrent Neural Networks for Source Separation
audio-separation deep-learning matlab rnn source-separation speech-denoising speech-separation
Language:MATLAB 364
double22a / speech_dataset
The dataset of Speech Recognition
asr speech-recognition deep-learning dataset audio deep-neural-networks wav speech-to-text speech tts speech-synthesis voice-conversion speech-translation speech-enhancement speech-diarization speech-separation speech-segmentation text-to-speech automatic-speech-recognition
362
speechbrain / speechbrain.github.io
The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.
deep-learning speech-recognition speech-to-text speech speech-processing speaker-recognition speaker-verification speaker-identification speech-separation speech-emotion-recognition speechrecognition speech-recognizer deeplearning neural-network neural-networks beamforming timit librispeech speech-analysis speech-api
Language:HTML 358
AppleHolic / source_separation
Deep learning based speech source separation using Pytorch
deep-learning source-separation pytorch audio speech speech-separation
Language:Jupyter Notebook 310
seanwood / gcc-nmf
Real-time GCC-NMF Blind Speech Separation and Enhancement
speech-separation speech-enhancement gcc-nmf nmf real-time real-time-processing speech speech-processing cross-correlation generalized-cross-correlation low-latency machine-learning unsupervised-machine-learning dictionary-learning gcc tdoa ipython-notebook speaker
Language:Python 308
aishoot / LSTM_PIT_Speech_Separation
Two-talker Speech Separation with LSTM/BLSTM by Permutation Invariant Training method.
speech-separation audio-separation robust-speech-recognition permutation-invariant-training multi-speaker speech-enhancement
Language:Jupyter Notebook 302
etzinis / sudo_rm_rf
Code for SuDoRm-Rf networks for efficient audio source separation. SuDoRm-Rf stands for SUccessive DOwnsampling and Resampling of Multi-Resolution Features which enables a more efficient way of separating sources from mixtures.
deep-learning speech speech-separation audio
Language:Jupyter Notebook 299
tky823 / DNN-based_source_separation
A PyTorch implementation of DNN-based source separation.
source-separation speech-separation pytorch tasnet audio-separation conv-tasnet
Language:Python 268
funcwj / conv-tasnet
A PyTorch implementation of "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation" (see recipes in aps framework https://github.com/funcwj/aps)
tasnet speech-separation pytorch
Language:Python 204
eesungkim / Speech_Enhancement_DNN_NMF
Speech Enhancement based on DNN (Spectral-Mapping, TF-Masking), DNN-NMF, NMF
dnn-speech-enhancement dnn-tf-masking dnn-spectral-mapping dnn-nmf nmf-speech-enhancement speech-separation source-separation
Language:Python 168
meokz / looking-to-listen
Deep neural network (DNN) for noise reduction, removal of background music, and speech separation
speech-separation noise-reduction
Language:Python 162
JusperLee / Looking-to-Listen-at-the-Cocktail-Party
Executable code based on Google articles
cocktail-party audio speech-separation facenet
Language:Python 160
funcwj / aps
A personal toolkit for single/multi-channel speech recognition & enhancement & separation.
speech-recognition speech-enhancement speech-separation multi-channel kaldi speech end-to-end
Language:Python 131
KyleZhang1118 / Voice-Separation-and-Enhancement
A framework for quick testing and comparing multi-channel speech enhancement and separation methods, such as DSB, MVDR, LCMV, GEVD beamforming and ICA, FastICA, IVA, AuxIVA, OverIVA, ILRMA, FastMNMF.
multi-channel speech-enhancement speech-separation
Language:MATLAB 126
JusperLee / Deep-Clustering-for-Speech-Separation
Pytorch implements Deep Clustering: Discriminative Embeddings For Segmentation And Separation
speech-separation segmentation pytorch deep-clustering
Language:Python 118
funcwj / deep-clustering
deep clustering method for single-channel speech separation
speech-separation pytorch
Language:Python 108
kaituoxu / TasNet
A PyTorch implementation of Time-domain Audio Separation Network (TasNet) with Permutation Invariant Training (PIT) for speech separation.
speech-separation source-separation audio-separation pit pytorch tasnet permutation-invariant-training
Language:Python 103
funcwj / uPIT-for-speech-separation
Speech separation with utterance-level PIT experiments
speech-separation pytorch pit
Language:Python 101
funcwj / voice-filter
A unofficial Pytorch implementation of Google's VoiceFilter
speech-separation pytorch
Language:Python 94
JusperLee / Calculate-SNR-SDR
Script to calculate SNR and SDR using python
sdr speech-analysis speech-separation
Language:Python 83
JusperLee / UtterancePIT-Speech-Separation
According to funcwj's uPIT, the training code supporting multi-gpu is written, and the Dataloader is reconstructed.
utterancepit-speech-separation dataloader speech-separation pytorch
Language:Python 63
cyrta / awesome-speech-enhancement
A curated list of awesome Speech Enhancement papers, libraries, datasets, and other resources.
speech-enhancement noise-reduction dereverberation speech-processing awesome awesome-list deep-learning denoising speech-denoising speech-separation
58

speech-separation

speechbrain / speechbrain

espnet / espnet

asteroid-team / asteroid

coqui-ai / open-speech-corpora

maum-ai / voicefilter

JusperLee / Speech-Separation-Paper-Tutorial

kaituoxu / Conv-TasNet

Audio-WestlakeU / FullSubNet

anicolson / DeepXi

gemengtju / Tutorial_Separation

JusperLee / Dual-Path-RNN-Pytorch

microsoft / UniSpeech

funcwj / setk

JusperLee / Conv-TasNet

posenhuang / deeplearningsourceseparation

double22a / speech_dataset

speechbrain / speechbrain.github.io

AppleHolic / source_separation

seanwood / gcc-nmf

aishoot / LSTM_PIT_Speech_Separation

etzinis / sudo_rm_rf

tky823 / DNN-based_source_separation

funcwj / conv-tasnet

eesungkim / Speech_Enhancement_DNN_NMF

meokz / looking-to-listen

JusperLee / Looking-to-Listen-at-the-Cocktail-Party

funcwj / aps

KyleZhang1118 / Voice-Separation-and-Enhancement

JusperLee / Deep-Clustering-for-Speech-Separation

funcwj / deep-clustering

kaituoxu / TasNet

funcwj / uPIT-for-speech-separation

funcwj / voice-filter

JusperLee / Calculate-SNR-SDR

JusperLee / UtterancePIT-Speech-Separation

cyrta / awesome-speech-enhancement