There are 114 repositories under voice-conversion topic.
SoftVC VITS Singing Voice Conversion
Easily train a good VC model with voice data <= 10 mins!
so-vits-svc fork with realtime support, improved interface and more features.
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)
🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).
This is now the official location of the Merlin project.
AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss
Unsupervised Speech Decomposition Via Triple Information Bottleneck
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
The code for the bark-voicecloning model. Training and inference.
Voice Conversion Tool Kit
Deep learning for audio processing
Voice Converter Using CycleGAN and Non-Parallel Data
Voice Conversion by CycleGAN (语音克隆/语音转换): CycleGAN-VC2
This is a pytorch implementation of the paper: StarGAN-VC: Non-parallel many-to-many voice conversion with star generative adversarial networks
基于javaFX的简单字幕处理桌面程序,集成在线翻译及语音转换
StarGANv2-VC: A Diverse, Unsupervised, Non-parallel Framework for Natural-Sounding Voice Conversion
VITS2: Improving Quality and Efficiency of Single-Stage Text-to-Speech with Adversarial Learning and Architecture Design
Audio style transfer with shallow random parameters CNN.
A paper and project list about the cutting edge Speech Synthesis, Text-to-Speech (TTS), Singing Voice Synthesis (SVS), Voice Conversion (VC), Singing Voice Conversion (SVC), and related interesting works (such as Music Synthesis, Automatic Music Transcription, Automatic MOS Prediction, SSL-based ASR...etc).
The dataset of Speech Recognition
PPG-Based Voice Conversion