Beast code in Giters

Matan Gover's starred repositories

Retrieval-based-Voice-Conversion-WebUI

Easily train a good VC model with voice data <= 10 mins!

Language:PythonMIT21408 161 1527

ultimatevocalremovergui

GUI for a Vocal Remover that uses Deep Neural Networks.

Language:PythonMIT16787 153 1195

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

Language:PythonMIT4319 58 138

beartype

Unbearably fast near-real-time hybrid runtime-static type-checking in pure Python.

Language:PythonMIT2534 17 318

so-vits-svc-5.0

Core Engine of Singing Voice Conversion & Singing Voice Clone

Language:PythonMIT2491 29 159

stable-audio-tools

Generative models for conditional audio generation

Language:PythonMIT2351 40 75

DDSP-SVC

Real-time end-to-end singing voice conversion system based on DDSP (Differentiable Digital Signal Processing)

Language:PythonMIT1742 20 59

SALMONN

SALMONN: Speech Audio Language Music Open Neural Network

Language:PythonApache-2.0915 26 37

cam2ip

Turn any webcam into an IP camera

Language:GoGPL-3.0855 33 45

CLAP

Learning audio concepts from natural language supervision

Language:PythonMIT434 14 18

Play

Free and open source singing game with song editor for desktop, mobile, and smart TV

Language:C#MIT376 26 248

ltu

Code, Dataset, and Pretrained Models for Audio and Speech Large Language Model "Listen, Think, and Understand".

Language:Python340 14 43

dasp-pytorch

Differentiable audio signal processors in PyTorch

Language:PythonApache-2.0215 10 5

nendo

The Nendo AI Audio Tool Suite

Language:PythonMIT202 7 7

RMVPE

Language:PythonApache-2.0189 4 6

Multilingual-PR

Phoneme Recognition using pre-trained models Wav2vec2, HuBERT and WavLM. Throughout this project, we compared specifically three different self-supervised models, Wav2vec (2019, 2020), HuBERT (2021) and WavLM (2022) pretrained on a corpus of English speech that we will use in various ways to perform phoneme recognition for different languages with a network trained with Connectionist Temporal Classification (CTC) algorithm.

Language:Python188 4 5

spleeterpp

A C++ Inference library for the Spleeter project

Language:C++MIT157 10 37

SpleeterRT

Real time monaural source separation base on fully convolutional neural network operates on Time-frequency domain.

Language:CGPL-3.0153 15 11

musicfm

Language:PythonNOASSERTION150 3 2

vocalsound

Dataset and baseline code for the VocalSound dataset (ICASSP2022).

Language:Jupyter Notebook95 2 6

FCPE

Language:PythonMIT86 5 5

MossFormer

This repo provides the processed samples of the manuscript "MossFormer: Pushing the Performance Limit of Monaural Speech Separation using Gated Single-head Transformer with Convolution-augmented Joint Self-Attentions", which was submitted to ICASSP 2023.

Apache-2.075 2 4

matangover