yasyune's starred repositories
GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
AnimateAnyone
Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation
magic-animate
[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model
StreamDiffusion
StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation
clone-voice
A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具,使用你的音色或任意声音来录制音频
HierSpeechpp
The official implementation of HierSpeech++
resemble-enhance
AI powered speech denoising and enhancement
Style-Bert-VITS2
Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles.
EasyBertVits2
文章から感情豊かな音声を生成する Bert-VITS2 を簡単に使えます。
Applio-Installer
Create, Experiment, Enjoy with Applio: Now Easier, Simpler and Faster!
descript-audio-vae
VAE modified from Descript Audio Codec, which replaces the RVQ with VAE
Bert-VITS2-Audio-Generator
GUI TTS Application based on Bert-VITS2
Aivis-Dataset
💠 Aivis: AI Voice Imitation System
PL-Bert-VITS2
VITS2 using Phoneme-Level Japanese BERT
RVC_Onnx_Infer
RVC Onnx Infer- Upgraded and simplified-ish
Bert-VITS2-JP
with shell script setup
rvc-onnx-test
for onnx export test from rvc
RVC_Onnx_Infer
RVC Onnx Infer- Upgraded and simplified-ish
Retrieval-based-Voice-Conversion-WebUI
Use less than 10 minutes vocal to fast train a voice conversion model!