voice-conversion

There are 132 repositories under voice-conversion topic.

coqui-ai / TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
python text-to-speech deep-learning speech pytorch tts vocoder tacotron glow-tts melgan speaker-encoder hifigan speaker-encodings multi-speaker-tts tts-model speech-synthesis voice-cloning voice-synthesis voice-conversion
Language:Python 42605
RVC-Project / Retrieval-based-Voice-Conversion-WebUI
Easily train a good VC model with voice data <= 10 mins!
change sovits vits voice voice-conversion rvc audio-analysis conversational-ai conversion converter retrieval-model retrieve-data so-vits-svc vc voice-converter voiceconversion
Language:Python 32000
svc-develop-team / so-vits-svc
SoftVC VITS Singing Voice Conversion
ai audio-analysis generative-adversarial-network singing-voice-conversion so-vits-svc sovits variational-inference vc vits voice voice-conversion voiceconversion voice-changer flow deep-learning pytorch speech
Language:Python 27613
espnet / espnet
End-to-End Speech Processing Toolkit
deep-learning end-to-end chainer pytorch kaldi speech-recognition speech-synthesis speech-translation machine-translation voice-conversion speech-enhancement speech-separation singing-voice-synthesis speaker-diarization spoken-language-understanding text-to-speech
Language:Python 9460
Amphion
open-mmlab / Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
audio-generation audio-synthesis audioldm music-generation naturalspeech2 singing-voice-conversion speech-synthesis text-to-audio text-to-speech vall-e voice-conversion audit fastspeech2 vits emilia maskgct vocoder
Language:Python 9383
voicepaw / so-vits-svc-fork
so-vits-svc fork with realtime support, improved interface and more features.
sovits vits voice-conversion so-vits-svc hubert softvc realtime voice-changer deep-learning pytorch speech-synthesis contentvec gan lightning pytorch-lightning hacktoberfest
Language:Python 9119
voice-pro
abus-aikorea / voice-pro
Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal isolation, and multilingual translation.
faster-whisper tts whisper gradio subtitles transcription translator webui speech-recognition speech-synthesis speech-to-text text-to-speech yt-dlp voice-cloning podcasts audiobook voice-conversion karaoke whisperx
Language:Python 4808
zzw922cn / awesome-speech-recognition-speech-synthesis-papers
Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)
automatic-speech-recognition papers roadmap rnn cnn dnn attention-mechanism seq2seq acoustic-model timit-dataset tts language-model speaker-verification speech-recognition speech-synthesis neural-network recognition-synthesis diffusion-models singing-voice-synthesis voice-conversion
3068
Applio
IAHispano / Applio
A simple, high-quality voice conversion tool focused on ease of use and performance.
rvc vc vits voice ai voice-cloning voice-conversion applio voice-clone pytorch speech speech-to-speech text-to-speech tts
Language:Python 2585
Plachtaa / seed-vc
zero-shot voice conversion & singing voice conversion, with real-time support
singing-voice-conversion voice-conversion
Language:Python 2180
jim-schwoebel / voice_datasets
🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).
voice-dataset voice-datasets audio-dataset audio-datasets datasets dataset voice data voice-computing voice-control voice-synthesis voice-commands voice-assistant voice-recognition voice-chat voice-activity-detection voice-conversion noise
2014
CSTR-Edinburgh / merlin
This is now the official location of the Merlin project.
merlin speech-synthesis text-to-speech voice-conversion deep-learning python theano tensorflow keras
Language:Python 1318
auspicious3000 / autovc
AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss
voice-conversion speech-synthesis generative-models tacotron-pytorch wavenet-vocoder unsupervised-learning
Language:Python 1050
Edresson / YourTTS
YourTTS: Towards Zero-Shot Multi-Speaker TTS and Zero-Shot Voice Conversion for everyone
voice-conversion zero-shot-voice-conversion zero-shot-multi-speaker-tts tts speech-synthesis
Language:Jupyter Notebook 959
Spr-Aachen / Easy-Voice-Toolkit
一个简易的AI语音工具箱 | A user-friendly audio toolkit for voice recognition, voice transcription, voice conversion etc.
audio-denoising voice-conversion voice-recognition voice-transcription software
Language:Python 839
Tiger14n / RVC-GUI
Just a fork of RVC for easy audio file voice conversion locally
rvc so-vits-svc sovits voice-changer voice-conversion
Language:Python 801
gabrielmittag / NISQA
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
speech-quality deep-learning interspeech icassp tts pytorch voice-conversion text-to-speech speech-synthesis quality-of-experience
Language:Python 759
gitmylo / bark-voice-cloning-HuBERT-quantizer
The code for the bark-voicecloning model. Training and inference.
ai neural-networks text-to-speech voice-cloning voice-conversion
Language:Python 694
markovka17 / dla
Deep learning for audio processing
deep-learning speech-recognition tts signal-processing voice-conversion keyword-spotting speaker-verification
Language:Jupyter Notebook 685
auspicious3000 / SpeechSplit
Unsupervised Speech Decomposition Via Triple Information Bottleneck
voice-conversion unsupervised-learning disentangled-representations
Language:Python 671
OlaWod / FreeVC
FreeVC: Towards High-Quality Text-Free One-Shot Voice Conversion
pytorch speech voice-conversion
Language:Python 660
PlayVoice / lora-svc
singing voice change based on whisper, and lora for singing voice clone
singing-voice-conversion voice-conversion voice-change vits voice-cloning speech-to-sing uni-svc whisper vits-svc lora
Language:Python 635
k2kobayashi / sprocket
Voice Conversion Tool Kit
voice-conversion speech-synthesis sprockets speech-enhancement
Language:Python 600
jackaduma / CycleGAN-VC2
Voice Conversion by CycleGAN (语音克隆/语音转换): CycleGAN-VC2
voice-conversion cyclegan-vc2 cyclegan gan deeplearning voice-cloning pytorch-implementation cyclegan-vc speech-synthesis deep-learning pix2pix aigc
Language:Python 560
daniilrobnikov / vits2
VITS2: Improving Quality and Efficiency of Single-Stage Text-to-Speech with Adversarial Learning and Architecture Design
deep-learning pytorch speech speech-synthesis text-to-speech tts vits2 voice-conversion
Language:Jupyter Notebook 552
leimao / Voice-Converter-CycleGAN
Voice Converter Using CycleGAN and Non-Parallel Data
voice-conversion cyclegan speech
Language:Python 529
liusongxiang / StarGAN-Voice-Conversion
This is a pytorch implementation of the paper: StarGAN-VC: Non-parallel many-to-many voice conversion with star generative adversarial networks
voice-conversion stargan pytorch-implementation
Language:Python 524
r9y9 / gantts
PyTorch implementation of GAN-based text-to-speech synthesis and voice conversion (VC)
speech-synthesis voice-conversion generative-adversarial-net gan nnmnkwii
Language:Jupyter Notebook 518
yl4579 / StarGANv2-VC
StarGANv2-VC: A Diverse, Unsupervised, Non-parallel Framework for Natural-Sounding Voice Conversion
voice-conversion speech-synthesis gan deep-learning speech stargan-v2 interspeech2021
Language:Python 509
ddPn08 / rvc-webui
liujing04/Retrieval-based-Voice-Conversion-WebUI reconstruction project
tts vits voice-conversion rvc
Language:Python 497
fordes123 / subtitles-view
基于javaFX的简单字幕处理桌面程序，集成在线翻译及语音转换
javafx javafx-desktop-apps javafx-gui mybatis-plus springboot sqlite3 subtitles-generator subtitles-search voice-conversion
Language:Java 487
bshall / knn-vc
Voice Conversion With Just Nearest Neighbors
any-to-any knn pytorch self-supervised-learning speech speech-synthesis voice-conversion
Language:Python 482
bshall / soft-vc
Soft speech units for voice conversion
voice-conversion speech-synthesis self-supervised-learning
Language:Jupyter Notebook 426
double22a / speech_dataset
The dataset of Speech Recognition
asr speech-recognition deep-learning dataset audio deep-neural-networks wav speech-to-text speech tts speech-synthesis voice-conversion speech-translation speech-enhancement speech-diarization speech-separation speech-segmentation text-to-speech automatic-speech-recognition
424
guan-yuan / Awesome-Singing-Voice-Synthesis-and-Singing-Voice-Conversion
A paper and project list about the cutting edge Speech Synthesis, Text-to-Speech (TTS), Singing Voice Synthesis (SVS), Voice Conversion (VC), Singing Voice Conversion (SVC), and related interesting works (such as Music Synthesis, Automatic Music Transcription, Automatic MOS Prediction, SSL-based ASR...etc).
pytorch singing-synthesis singing-voice singing-voice-conversion singing-voice-synthesis speech speech-synthesis tts voice-conversion music music-generation music-synthesis automatic-music-transcription diffusion-models mos-prediction music-transcription text-to-speech
422
mazzzystar / randomCNN-voice-transfer
Audio style transfer with shallow random parameters CNN.
style-transfer voice-transfer voice-conversion speech-conversion
Language:Python 404