chenht2010's repositories

Language:PythonLicense:Apache-2.0Stargazers:29Issues:0Issues:0

textlesslib

Library for Textless Spoken Language Processing

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

AlignSTS

Findings of ACL 2023 | AlignSTS: a speech-to-singing (STS) model based on modality disentanglement and cross-modal alignment

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

awesome-whisper

🔊 Awesome list for Whisper — an open-source AI-powered speech recognition system developed by OpenAI

License:CC0-1.0Stargazers:0Issues:0Issues:0

bytecover

Implementation of "Bytecover: Cover song identification via multi-loss training" paper (ICASSP 2021)

Language:PythonStargazers:0Issues:0Issues:0

encodec

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Music-Source-Separation-Training

Repository for training models for music source separation.

Language:PythonStargazers:0Issues:0Issues:0

MVSEP-MDX23-music-separation-model

Model for MDX23 music separation contest

Language:PythonStargazers:0Issues:0Issues:0

naturalspeech2-pytorch

Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

NeMo

NeMo: a toolkit for conversational AI

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:1Issues:0
Language:PythonStargazers:0Issues:0Issues:0

soundstorm-pytorch

Implementation of SoundStorm, Efficient Parallel Audio Generation from Google Deepmind, in Pytorch

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:ShellLicense:Apache-2.0Stargazers:0Issues:0Issues:0

Speech-Separation-Paper-Tutorial

A must-read paper for speech separation based on neural networks

Stargazers:0Issues:0Issues:0

ultimatevocalremovergui

GUI for a Vocal Remover that uses Deep Neural Networks.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

vall-e

PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

vocalsound

Dataset and baseline code for the VocalSound dataset (ICASSP2022).

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

VALL-E-X

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io

License:MITStargazers:0Issues:0Issues:0

voicebox-pytorch

Implementation of Voicebox, new SOTA Text-to-speech network from MetaAI, in Pytorch

Language:PythonLicense:MITStargazers:0Issues:0Issues:0