Matan Gover (matangover)

matangover

Geek Repo

Location:Montreal, Canada

Home Page:https://www.matangover.com

Github PK Tool:Github PK Tool


Organizations
DDMAL

Matan Gover's starred repositories

Retrieval-based-Voice-Conversion-WebUI

Easily train a good VC model with voice data <= 10 mins!

Language:PythonLicense:MITStargazers:21408Issues:161Issues:1527

ultimatevocalremovergui

GUI for a Vocal Remover that uses Deep Neural Networks.

Language:PythonLicense:MITStargazers:16787Issues:153Issues:1195

Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

Language:PythonLicense:MITStargazers:4319Issues:58Issues:138

beartype

Unbearably fast near-real-time hybrid runtime-static type-checking in pure Python.

Language:PythonLicense:MITStargazers:2534Issues:17Issues:318

so-vits-svc-5.0

Core Engine of Singing Voice Conversion & Singing Voice Clone

Language:PythonLicense:MITStargazers:2491Issues:29Issues:159

stable-audio-tools

Generative models for conditional audio generation

Language:PythonLicense:MITStargazers:2351Issues:40Issues:75

DDSP-SVC

Real-time end-to-end singing voice conversion system based on DDSP (Differentiable Digital Signal Processing)

Language:PythonLicense:MITStargazers:1742Issues:20Issues:59

SALMONN

SALMONN: Speech Audio Language Music Open Neural Network

Language:PythonLicense:Apache-2.0Stargazers:915Issues:26Issues:37

cam2ip

Turn any webcam into an IP camera

Language:GoLicense:GPL-3.0Stargazers:855Issues:33Issues:45

CLAP

Learning audio concepts from natural language supervision

Language:PythonLicense:MITStargazers:434Issues:14Issues:18

Play

Free and open source singing game with song editor for desktop, mobile, and smart TV

Language:C#License:MITStargazers:376Issues:26Issues:248

ltu

Code, Dataset, and Pretrained Models for Audio and Speech Large Language Model "Listen, Think, and Understand".

dasp-pytorch

Differentiable audio signal processors in PyTorch

Language:PythonLicense:Apache-2.0Stargazers:215Issues:10Issues:5

nendo

The Nendo AI Audio Tool Suite

Language:PythonLicense:MITStargazers:202Issues:7Issues:7
Language:PythonLicense:Apache-2.0Stargazers:189Issues:4Issues:6

Multilingual-PR

Phoneme Recognition using pre-trained models Wav2vec2, HuBERT and WavLM. Throughout this project, we compared specifically three different self-supervised models, Wav2vec (2019, 2020), HuBERT (2021) and WavLM (2022) pretrained on a corpus of English speech that we will use in various ways to perform phoneme recognition for different languages with a network trained with Connectionist Temporal Classification (CTC) algorithm.

spleeterpp

A C++ Inference library for the Spleeter project

Language:C++License:MITStargazers:157Issues:10Issues:37

SpleeterRT

Real time monaural source separation base on fully convolutional neural network operates on Time-frequency domain.

Language:CLicense:GPL-3.0Stargazers:153Issues:15Issues:11
Language:PythonLicense:NOASSERTIONStargazers:150Issues:3Issues:2

vocalsound

Dataset and baseline code for the VocalSound dataset (ICASSP2022).

Language:Jupyter NotebookStargazers:95Issues:2Issues:6
Language:PythonLicense:MITStargazers:86Issues:5Issues:5

MossFormer

This repo provides the processed samples of the manuscript "MossFormer: Pushing the Performance Limit of Monaural Speech Separation using Gated Single-head Transformer with Convolution-augmented Joint Self-Attentions", which was submitted to ICASSP 2023.

singing_transcription_ICASSP2021

The source code and pre-trained model of the paper "On the Preparation and Validation of a Large-scale Dataset"

BPE-Symbolic-Music

Code of the paper "Byte Pair Encoding for Symbolic Music" (EMNLP 2023). Better and faster music generation

MAEST

Pre-training, fine-tuning, and inference code with the MAEST models for music analysis applications.

Language:PythonLicense:AGPL-3.0Stargazers:35Issues:4Issues:2

wav-aec

Applying webrtc's acoustic echo cancellation (AEC) to audio files

Language:C++Stargazers:34Issues:3Issues:0

rt-vamp-plugin-sdk

Real-time Vamp plugin SDK for C++20

Language:C++License:MITStargazers:10Issues:2Issues:1