Tingle Li's starred repositories

VisualVoice

Audio-Visual Speech Separation with Cross-Modal Consistency

Language:PythonLicense:NOASSERTIONStargazers:219Issues:0Issues:0

lhotse

Tools for handling speech data in machine learning projects.

Language:PythonLicense:Apache-2.0Stargazers:936Issues:0Issues:0

CPC_audio

An implementation of the Contrast Predictive Coding (CPC) method to train audio features in an unsupervised fashion.

Language:PythonLicense:MITStargazers:347Issues:0Issues:0

audiomentations

A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.

Language:PythonLicense:MITStargazers:1827Issues:0Issues:0

barlowtwins

PyTorch implementation of Barlow Twins.

Language:PythonLicense:MITStargazers:961Issues:0Issues:0

pywsj0-mix

wsj0-{2, 3, 4, 5} mix generation scripts, in Python.

Language:PythonLicense:MITStargazers:48Issues:0Issues:0

s3prl

Self-Supervised Speech Pre-training and Representation Learning Toolkit

Language:PythonLicense:Apache-2.0Stargazers:2224Issues:0Issues:0

speechbrain

A PyTorch-based Speech Toolkit

Language:PythonLicense:Apache-2.0Stargazers:8673Issues:0Issues:0

nnAudio

Audio processing by using pytorch 1D convolution network

Language:PythonLicense:MITStargazers:1013Issues:0Issues:0

svoice

We provide a PyTorch implementation of the paper Voice Separation with an Unknown Number of Multiple Speakers In which, we present a new method for separating a mixed audio sequence, in which multiple voices speak simultaneously. The new method employs gated neural networks that are trained to separate the voices at multiple processing steps, while maintaining the speaker in each output channel fixed. A different model is trained for every number of possible speakers, and the model with the largest number of speakers is employed to select the actual number of speakers in a given sample. Our method greatly outperforms the current state of the art, which, as we show, is not competitive for more than two speakers.

Language:PythonLicense:NOASSERTIONStargazers:1234Issues:0Issues:0

CVC

CVC: Contrastive Learning for Non-parallel Voice Conversion (INTERSPEECH 2021, in PyTorch)

Language:PythonLicense:MITStargazers:57Issues:0Issues:0

DawDreamer

Digital Audio Workstation with Python; VST instruments/effects, parameter automation, FAUST, JAX, Warp Markers, and JUCE processors

Language:C++License:GPL-3.0Stargazers:905Issues:0Issues:0

pytorch-template

PyTorch deep learning projects made easy.

Language:PythonLicense:MITStargazers:4715Issues:0Issues:0

sudo_rm_rf

Code for SuDoRm-Rf networks for efficient audio source separation. SuDoRm-Rf stands for SUccessive DOwnsampling and Resampling of Multi-Resolution Features which enables a more efficient way of separating sources from mixtures.

Language:Jupyter NotebookLicense:MITStargazers:305Issues:0Issues:0

LibriMix

An open source dataset for source separation

Language:PythonLicense:MITStargazers:366Issues:0Issues:0
Language:PythonLicense:MITStargazers:54Issues:0Issues:0

WavAugment

A library for speech data augmentation in time-domain

Language:PythonLicense:MITStargazers:635Issues:0Issues:0

awesome-audio-visual

A curated list of different papers and datasets in various areas of audio-visual processing

Stargazers:660Issues:0Issues:0

Tutorial_Separation

This repo summarizes the tutorials, datasets, papers, codes and tools for speech separation and speaker extraction task. You are kindly invited to pull requests.

Language:MATLABStargazers:437Issues:0Issues:0

onssen

An open-source speech separation and enhancement library

Language:PythonLicense:GPL-3.0Stargazers:211Issues:0Issues:0

asteroid

The PyTorch-based audio source separation toolkit for researchers

Language:PythonLicense:MITStargazers:2237Issues:0Issues:0

awesome-multimodal-ml

Reading list for research topics in multimodal machine learning

License:MITStargazers:5942Issues:0Issues:0

NGCF-PyTorch

PyTorch Implementation for Neural Graph Collaborative Filtering

Language:PythonStargazers:280Issues:0Issues:0

Speech-Separation-Paper-Tutorial

A must-read paper for speech separation based on neural networks

Stargazers:744Issues:0Issues:0

pydiogment

:mega: Python library for audio augmentation

Language:PythonLicense:BSD-3-ClauseStargazers:83Issues:0Issues:0

meta-tasnet

A PyTorch implementation of Meta-TasNet from "Meta-learning Extractors for Music Source Separation

Language:PythonLicense:MITStargazers:136Issues:0Issues:0

speechmetrics

A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR

Language:PythonLicense:MITStargazers:894Issues:0Issues:0

dual-path-RNNs-DPRNNs-based-speech-separation

A PyTorch implementation of dual-path RNNs (DPRNNs) based speech separation described in "Dual-path RNN: efficient long sequence modeling for time-domain single-channel speech separation".

Language:PythonStargazers:167Issues:0Issues:0

Wave-U-Net-Pytorch

Improved Wave-U-Net implemented in Pytorch

Language:PythonLicense:MITStargazers:303Issues:0Issues:0

spleeter

Deezer source separation library including pretrained models.

Language:PythonLicense:MITStargazers:25711Issues:0Issues:0