Tingle Li's starred repositories
VisualVoice
Audio-Visual Speech Separation with Cross-Modal Consistency
audiomentations
A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.
barlowtwins
PyTorch implementation of Barlow Twins.
pywsj0-mix
wsj0-{2, 3, 4, 5} mix generation scripts, in Python.
speechbrain
A PyTorch-based Speech Toolkit
svoice
We provide a PyTorch implementation of the paper Voice Separation with an Unknown Number of Multiple Speakers In which, we present a new method for separating a mixed audio sequence, in which multiple voices speak simultaneously. The new method employs gated neural networks that are trained to separate the voices at multiple processing steps, while maintaining the speaker in each output channel fixed. A different model is trained for every number of possible speakers, and the model with the largest number of speakers is employed to select the actual number of speakers in a given sample. Our method greatly outperforms the current state of the art, which, as we show, is not competitive for more than two speakers.
DawDreamer
Digital Audio Workstation with Python; VST instruments/effects, parameter automation, FAUST, JAX, Warp Markers, and JUCE processors
pytorch-template
PyTorch deep learning projects made easy.
sudo_rm_rf
Code for SuDoRm-Rf networks for efficient audio source separation. SuDoRm-Rf stands for SUccessive DOwnsampling and Resampling of Multi-Resolution Features which enables a more efficient way of separating sources from mixtures.
WavAugment
A library for speech data augmentation in time-domain
awesome-audio-visual
A curated list of different papers and datasets in various areas of audio-visual processing
Tutorial_Separation
This repo summarizes the tutorials, datasets, papers, codes and tools for speech separation and speaker extraction task. You are kindly invited to pull requests.
awesome-multimodal-ml
Reading list for research topics in multimodal machine learning
NGCF-PyTorch
PyTorch Implementation for Neural Graph Collaborative Filtering
Speech-Separation-Paper-Tutorial
A must-read paper for speech separation based on neural networks
pydiogment
:mega: Python library for audio augmentation
meta-tasnet
A PyTorch implementation of Meta-TasNet from "Meta-learning Extractors for Music Source Separation
speechmetrics
A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR
dual-path-RNNs-DPRNNs-based-speech-separation
A PyTorch implementation of dual-path RNNs (DPRNNs) based speech separation described in "Dual-path RNN: efficient long sequence modeling for time-domain single-channel speech separation".
Wave-U-Net-Pytorch
Improved Wave-U-Net implemented in Pytorch