voice-cloning

There are 256 repositories under voice-cloning topic.

Real-Time-Voice-Cloning
CorentinJ / Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
deep-learning python pytorch tensorflow tts voice-cloning
Language:Python 50957
coqui-ai / TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
deep-learning glow-tts hifigan melgan multi-speaker-tts python pytorch speaker-encoder speaker-encodings speech speech-synthesis tacotron text-to-speech tts tts-model vocoder voice-cloning voice-conversion voice-synthesis
Language:Python 29828
RVC-Boss / GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
text-to-speech tts vits voice-clone voice-cloneai voice-cloning
Language:Python 24812
PaddlePaddle / PaddleSpeech
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
transformer conformer speech-translation streaming-asr speech-alignment punctuation-restoration streaming-tts speech-synthesis tts asr kws speech-recognition sound-classification voice-cloning vocoder voice-recognition self-supervised-learning wav2vec2 whisper code-switch
Language:Python 10246
BenAAndrew / Voice-Cloning-App
A Python/Pytorch app for easily synthesising human voices
deep-learning python pytorch tacotron2 text-to-speech tts voice-cloning
Language:Python 1348
coqui-ai / open-speech-corpora
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
speech-emotion-recognition speech-processing speech-recognition speech-separation speech-synthesis speech-to-text stt text-to-speech tts voice-activity-detection voice-cloning voice-recognition
1214
Applio
IAHispano / Applio
VITS-based Voice Conversion focused on simplicity, quality and performance.
ai applio pytorch rvc speech speech-to-speech text-to-speech vc vits voice voice-clone voice-cloning voice-conversion
Language:Python 1081
gitmylo / audio-webui
A webui for different audio related Neural Networks
ai aio all-in-one artificial-intelligence audiocraft audioldm bark bark-gui generative-audio generative-music music rvc rvc-gui text-to-audio text-to-speech tts voice-cloning
Language:Python 919
Tomiinek / Multilingual_Text_to_Speech
An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.
text-to-speech speech-synthesis multilingual tts code-switching voice-cloning
Language:Python 810
wunjo.wladradchenko.ru
wladradchenko / wunjo.wladradchenko.ru
Wunjo AI: Synthesize & clone voices in English, Russian & Chinese, real-time speech recognition, deepfake face & lips animation, face swap with one photo, change video by text prompts, segmentation, and retouching. Open-source, local & free.
free image-animation tacotron2 talking-face talking-face-generation talking-head tts wunjo face-swap face-swapping voice-recognition voice-cloning deepfake-emotion retouching-video controlnet diffusion segment-anything vid2vid deepfake deepfakes
Language:Python 722
PaddlePaddle / Parakeet
PAddle PARAllel text-to-speech toolKIT (supporting Tacotron2, Transformer TTS, FastSpeech2/FastPitch, SpeedySpeech, WaveFlow and Parallel WaveGAN)
text-to-speech speech-synthesis tacotron2 transformer-tts waveflow speedyspeech fastspeech2 parallelwavegan multi-speaker-tts text-frontend ge2e voice-cloning fastpitch
Language:Python 599
gitmylo / bark-voice-cloning-HuBERT-quantizer
The code for the bark-voicecloning model. Training and inference.
ai neural-networks text-to-speech voice-cloning voice-conversion
Language:Python 598
PlayVoice / lora-svc
singing voice change based on whisper, and lora for singing voice clone
singing-voice-conversion voice-conversion voice-change vits voice-cloning speech-to-sing uni-svc whisper vits-svc lora
Language:Python 588
jackaduma / CycleGAN-VC2
Voice Conversion by CycleGAN (语音克隆/语音转换): CycleGAN-VC2
aigc cyclegan cyclegan-vc cyclegan-vc2 deep-learning deeplearning gan pix2pix pytorch-implementation speech-synthesis voice-cloning voice-conversion
Language:Python 505
SforAiDl / Neural-Voice-Cloning-With-Few-Samples
This repository has implementation for "Neural Voice Cloning With Few Samples"
deep-learning mel-spectogram saidl speaker-adaptation speaker-encodings speech-processing tts voice voice-cloning voice-synthesis
Language:Python 422
Multi-Tacotron-Voice-Cloning
vlomme / Multi-Tacotron-Voice-Cloning
Phoneme multilingual(Russian-English) voice cloning based on
deep-learning pytorch tensorflow tts voice-cloning g2p tacotron wavernn russian
Language:Python 381
deterministic-algorithms-lab / Cross-Lingual-Voice-Cloning
Tacotron 2 - PyTorch implementation with faster-than-realtime inference modified to enable cross lingual voice cloning.
text-to-speech voice-cloning multi-lingual pytorch vae
Language:Jupyter Notebook 353
Sharad24 / Neural-Voice-Cloning-with-Few-Samples
Implementation of Neural Voice Cloning with Few Samples Research Paper by Baidu
voice-cloning speech-synthesis speech-processing speaker-encodings encodings speech speaker-embeddings mel-spectrogram
Language:Python 252
CMsmartvoice / One-Shot-Voice-Cloning
:relaxed: One Shot Voice Cloning base on Unet-TTS
tts style-transfer one-shot voice-cloning
Language:Jupyter Notebook 233
dunky11 / voicesmith
[WIP] VoiceSmith makes training text to speech models easy.
dataset-manager delightfultts preprocessing speech-synthesis text-to-speech toolkit tts univnet voice-cloning
Language:Python 211
pranauv1 / AI-Video-Translation
A simple Google Colab notebook which can translate an original video into multiple languages along with lip sync.
lip-sync translation voice-cloning
Language:Jupyter Notebook 187
BoltzmannEntropy / xtts2-ui
A User Interface for XTTS-2 Text-Based Voice Cloning using only 10 seconds of speech
coqui-tts tts voice-cloning streamlit
Language:Python 174
WeeaBlind
FlorianEagox / WeeaBlind
A program to dub non-english media with modern AI speech synthesis, diarization, and voice cloning!
a11y accessibility anime blindness diariz dubbing python tts voice-cloning
Language:Python 171
Voice-synthesis
smoke-trees / Voice-synthesis
This repository is an implementation of Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis (SV2TTS) with a vocoder that works in real-time. SV2TTS is a three-stage deep learning framework that allows to create a numerical representation of a voice from a few seconds of audio, and to use it to condition a text-to-speech model trained to generalize to new voices.
voice-synthesis voice-cloning sv2tts pytorch-implementation tensorflow keras speech-to-text
Language:Python 159
simsax / Voice_cloner
A guide to clone anyone's voice and use it as a text-to-speech with android
android text-to-speech tts voice-cloning
Language:Python 157
SayanoAI / RVC-Studio
The best looking and most functional webui for RVC related tasks. See website for UI demo:
ai ai-voice-changers rvc rvc-project voice-cloning rvc-studio
Language:Python 150
jackaduma / CycleGAN-VC3
Voice Conversion by CycleGAN (语音克隆/语音转换)：CycleGAN-VC3
aigc cyclegan cyclegan-vc cyclegan-vc2 cyclegan-vc3 gan pytorch pytorch-implementation voice-cloning voice-conversion
Language:Python 134
kanttouchthis / text_generation_webui_xtts
XTTSv2 Extension for oobabooga text-generation-webui
llm tts voice-cloning
Language:Python 129
AIFSH / ComfyUI-GPT_SoVITS
a comfyui custom node for GPT-SoVITS! you can voice cloning and tts in comfyui now
gpt-sovits tts voice-cloning
Language:Python 122
sidharthrajaram / StyleTTS2
🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning
deep-learning python speech-synthesis text-to-speech transformers tts voice-cloning
Language:Python 86
resemble-ai / resemble-alexa
This is sample code for an Alexa skill that uses realistic voice cloning powered by Resemble AI's text-to-speech API, and Open AI’s GPT-3 AI engine.
alexa artificial-intelligence assistant-chat-bots text-to-speech tts voice voice-cloning
Language:Python 85
lukaszliniewicz / Pandrator
Pandrator aspires to be a user-friendly app with a graphical interface and a one-click installer that creates high-quality speech from text in multiple languages (audiobooks, speech synchronised with subtitles and more) using local models (XTTS, Silero or VoiceCraft), plus voice cloning, LLM pre-processing, RVC enhancement, and automatic evaluation
audiobook audiobook-creator audiobook-maker audiobooks text-processing text-to-speech customtkinterprojects llm nisqa rvc tkinter-gui xtts xttsv2 silero voice-cloning voicecraft dubbing pdf-to-audio subtitle-to-speech subtitle-to-voice
Language:Python 77
AdiKsOnDev / YouTranslate
Takes a youtube video, clones the voice and re-creates that video in a different language
ai elevenlabs-api localization-tool translation voice-cloning voice-recognition youtube collaborate github
Language:Python 65
everydaycodings / MimicMania
MimicMania is a web application that allows you to generate speech and clone voices using text-to-speech technology. With MimicMania, you can create custom voices in a variety of languages and use them for a range of applications, from voiceovers to chatbots.
cloning jspeech python streamlit tacotron text-to-speech tts voice-cloning hacktoberfest
Language:Python 59
0417keito / VALL-E-X-Trainer-by-CustomData
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io
singing-voice-synthesis text-to-speech tts vall-e voice-cloning
Language:Python 58
hparcells / rtvc
💬 "Realtime" voice transcription and cloning using ElevenLabs's API.
ai elevenlabs voice-cloning api website voice-synthesis voicecloning interactive transcription speech-to-speech web
Language:TypeScript 47

voice-cloning

CorentinJ / Real-Time-Voice-Cloning

coqui-ai / TTS

RVC-Boss / GPT-SoVITS

PaddlePaddle / PaddleSpeech

BenAAndrew / Voice-Cloning-App

coqui-ai / open-speech-corpora

IAHispano / Applio

gitmylo / audio-webui

Tomiinek / Multilingual_Text_to_Speech

wladradchenko / wunjo.wladradchenko.ru

PaddlePaddle / Parakeet

gitmylo / bark-voice-cloning-HuBERT-quantizer

PlayVoice / lora-svc

jackaduma / CycleGAN-VC2

SforAiDl / Neural-Voice-Cloning-With-Few-Samples

vlomme / Multi-Tacotron-Voice-Cloning

deterministic-algorithms-lab / Cross-Lingual-Voice-Cloning

Sharad24 / Neural-Voice-Cloning-with-Few-Samples

CMsmartvoice / One-Shot-Voice-Cloning

dunky11 / voicesmith

pranauv1 / AI-Video-Translation

BoltzmannEntropy / xtts2-ui

FlorianEagox / WeeaBlind

smoke-trees / Voice-synthesis

simsax / Voice_cloner

SayanoAI / RVC-Studio

jackaduma / CycleGAN-VC3

kanttouchthis / text_generation_webui_xtts

AIFSH / ComfyUI-GPT_SoVITS

sidharthrajaram / StyleTTS2

resemble-ai / resemble-alexa

lukaszliniewicz / Pandrator

AdiKsOnDev / YouTranslate

everydaycodings / MimicMania

0417keito / VALL-E-X-Trainer-by-CustomData

hparcells / rtvc