Matan Gover's starred repositories
Retrieval-based-Voice-Conversion-WebUI
Easily train a good VC model with voice data <= 10 mins!
ultimatevocalremovergui
GUI for a Vocal Remover that uses Deep Neural Networks.
so-vits-svc-5.0
Core Engine of Singing Voice Conversion & Singing Voice Clone
stable-audio-tools
Generative models for conditional audio generation
dasp-pytorch
Differentiable audio signal processors in PyTorch
Multilingual-PR
Phoneme Recognition using pre-trained models Wav2vec2, HuBERT and WavLM. Throughout this project, we compared specifically three different self-supervised models, Wav2vec (2019, 2020), HuBERT (2021) and WavLM (2022) pretrained on a corpus of English speech that we will use in various ways to perform phoneme recognition for different languages with a network trained with Connectionist Temporal Classification (CTC) algorithm.
spleeterpp
A C++ Inference library for the Spleeter project
SpleeterRT
Real time monaural source separation base on fully convolutional neural network operates on Time-frequency domain.
vocalsound
Dataset and baseline code for the VocalSound dataset (ICASSP2022).
MossFormer
This repo provides the processed samples of the manuscript "MossFormer: Pushing the Performance Limit of Monaural Speech Separation using Gated Single-head Transformer with Convolution-augmented Joint Self-Attentions", which was submitted to ICASSP 2023.
singing_transcription_ICASSP2021
The source code and pre-trained model of the paper "On the Preparation and Validation of a Large-scale Dataset"
BPE-Symbolic-Music
Code of the paper "Byte Pair Encoding for Symbolic Music" (EMNLP 2023). Better and faster music generation
rt-vamp-plugin-sdk
Real-time Vamp plugin SDK for C++20