Tinglok

We provide a PyTorch implementation of the paper Voice Separation with an Unknown Number of Multiple Speakers In which, we present a new method for separating a mixed audio sequence, in which multiple voices speak simultaneously. The new method employs gated neural networks that are trained to separate the voices at multiple processing steps, while maintaining the speaker in each output channel fixed. A different model is trained for every number of possible speakers, and the model with the largest number of speakers is employed to select the actual number of speakers in a given sample. Our method greatly outperforms the current state of the art, which, as we show, is not competitive for more than two speakers.

Language:PythonNOASSERTION123400

CVC

CVC: Contrastive Learning for Non-parallel Voice Conversion (INTERSPEECH 2021, in PyTorch)

Language:PythonMIT5700

DawDreamer

Digital Audio Workstation with Python; VST instruments/effects, parameter automation, FAUST, JAX, Warp Markers, and JUCE processors

Language:C++GPL-3.090500

pytorch-template

PyTorch deep learning projects made easy.

Language:PythonMIT471500

sudo_rm_rf

Code for SuDoRm-Rf networks for efficient audio source separation. SuDoRm-Rf stands for SUccessive DOwnsampling and Resampling of Multi-Resolution Features which enables a more efficient way of separating sources from mixtures.

Language:Jupyter NotebookMIT30500

LibriMix

An open source dataset for source separation

Language:PythonMIT36600

SparseLibriMix

Language:PythonMIT5400

WavAugment

A library for speech data augmentation in time-domain

Language:PythonMIT63500

awesome-audio-visual

A curated list of different papers and datasets in various areas of audio-visual processing

66000

Tutorial_Separation

This repo summarizes the tutorials, datasets, papers, codes and tools for speech separation and speaker extraction task. You are kindly invited to pull requests.

Language:MATLAB43700

onssen

An open-source speech separation and enhancement library

Language:PythonGPL-3.021100

asteroid

The PyTorch-based audio source separation toolkit for researchers

Language:PythonMIT223700

awesome-multimodal-ml

Reading list for research topics in multimodal machine learning

MIT594200

NGCF-PyTorch

PyTorch Implementation for Neural Graph Collaborative Filtering

Language:Python28000

Speech-Separation-Paper-Tutorial

A must-read paper for speech separation based on neural networks

74400

pydiogment

:mega: Python library for audio augmentation

Language:PythonBSD-3-Clause8300

meta-tasnet

A PyTorch implementation of Meta-TasNet from "Meta-learning Extractors for Music Source Separation

Language:PythonMIT13600

speechmetrics

A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR

Language:PythonMIT89400

dual-path-RNNs-DPRNNs-based-speech-separation

A PyTorch implementation of dual-path RNNs (DPRNNs) based speech separation described in "Dual-path RNN: efficient long sequence modeling for time-domain single-channel speech separation".

Language:Python16700

Wave-U-Net-Pytorch

Improved Wave-U-Net implemented in Pytorch

Language:PythonMIT30300

spleeter

Deezer source separation library including pretrained models.

Language:PythonMIT2571100