yasyune

followers

following

stars

yasyune's starred repositories

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Language:PythonMIT32112 196 1162

OpenVoice

Instant voice cloning by MIT and MyShell.

Language:PythonMIT28261 211 227

AnimateAnyone

Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation

Apache-2.014244 671 90

magic-animate

[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model

Language:PythonBSD-3-Clause10351 103 145

StreamDiffusion

StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation

Language:PythonApache-2.09436 78 114

clone-voice

A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具，使用你的音色或任意声音来录制音频

Language:PythonNOASSERTION7011 38 125

Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

Language:PythonMIT4428 58 151

WhisperSpeech

An Open Source text-to-speech system built by inverting Whisper.

Language:Jupyter NotebookMIT3750 75 100

metavoice-src

Foundational model for human-like, expressive TTS

Language:PythonApache-2.03694 77 123

resemble-enhance

AI powered speech denoising and enhancement

Language:PythonMIT1232 16 43

open-tts-tracker

DragonianVoice

多个SVC/TTS的C++推理库

Language:CAGPL-3.0981 17 45

Style-Bert-VITS2

Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles.

Language:PythonAGPL-3.0680 14 103

megatts2

Unoffical implementation of Megatts2

Language:PythonMIT252 23 20

NeuCoSVC

Language:Python238 7 7

EasyBertVits2

文章から感情豊かな音声を生成する Bert-VITS2 を簡単に使えます。

Language:BatchfileMIT134 4 4

Aivis

💠 Aivis: AI Voice Imitation System

Language:PythonMIT129 3 2

FCPE

Language:PythonMIT90 5 5

Applio-Installer

Create, Experiment, Enjoy with Applio: Now Easier, Simpler and Faster!

Language:HTML48 30

descript-audio-vae

VAE modified from Descript Audio Codec, which replaces the RVQ with VAE

Language:PythonMIT42 8 1

Hifi-vaegan

Language:PythonAGPL-3.038 50

slice-and-transcribe

Language:PythonMIT26 10

Aivis-Dataset

💠 Aivis: AI Voice Imitation System

Language:PythonMIT25 20

fastersvc

Language:Python2500

RVC_Onnx_Infer

RVC Onnx Infer- Upgraded and simplified-ish

Language:PythonMIT19 4 3

Style-Bert-VITS2-DiscordBot

BertVITS2を使って読み上げてくれるDiscordの読み上げボットのプログラム

Language:PythonMIT1700

PL-Bert-VITS2

VITS2 using Phoneme-Level Japanese BERT

Language:PythonApache-2.012 3 1

rvc-onnx-test

for onnx export test from rvc

Language:PythonMIT4 30

RVC_Onnx_Infer

RVC Onnx Infer- Upgraded and simplified-ish

Language:Python2 10

Retrieval-based-Voice-Conversion-WebUI

Use less than 10 minutes vocal to fast train a voice conversion model!

Language:PythonMIT100