yasyune's starred repositories

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Language:PythonLicense:MITStargazers:32112Issues:196Issues:1162

OpenVoice

Instant voice cloning by MIT and MyShell.

Language:PythonLicense:MITStargazers:28261Issues:211Issues:227

AnimateAnyone

Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation

magic-animate

[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model

Language:PythonLicense:BSD-3-ClauseStargazers:10351Issues:103Issues:145

StreamDiffusion

StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation

Language:PythonLicense:Apache-2.0Stargazers:9436Issues:78Issues:114

clone-voice

A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具,使用你的音色或任意声音来录制音频

Language:PythonLicense:NOASSERTIONStargazers:7011Issues:38Issues:125

Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

Language:PythonLicense:MITStargazers:4428Issues:58Issues:151

WhisperSpeech

An Open Source text-to-speech system built by inverting Whisper.

Language:Jupyter NotebookLicense:MITStargazers:3750Issues:75Issues:100

metavoice-src

Foundational model for human-like, expressive TTS

Language:PythonLicense:Apache-2.0Stargazers:3694Issues:77Issues:123

resemble-enhance

AI powered speech denoising and enhancement

Language:PythonLicense:MITStargazers:1232Issues:16Issues:43

DragonianVoice

多个SVC/TTS的C++推理库

Language:CLicense:AGPL-3.0Stargazers:981Issues:17Issues:45

Style-Bert-VITS2

Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles.

Language:PythonLicense:AGPL-3.0Stargazers:680Issues:14Issues:103

megatts2

Unoffical implementation of Megatts2

Language:PythonLicense:MITStargazers:252Issues:23Issues:20

EasyBertVits2

文章から感情豊かな音声を生成する Bert-VITS2 を簡単に使えます。

Language:BatchfileLicense:MITStargazers:134Issues:4Issues:4

Aivis

💠 Aivis: AI Voice Imitation System

Language:PythonLicense:MITStargazers:129Issues:3Issues:2
Language:PythonLicense:MITStargazers:90Issues:5Issues:5

Applio-Installer

Create, Experiment, Enjoy with Applio: Now Easier, Simpler and Faster!

Language:HTMLStargazers:48Issues:3Issues:0

descript-audio-vae

VAE modified from Descript Audio Codec, which replaces the RVQ with VAE

Language:PythonLicense:MITStargazers:42Issues:8Issues:1
Language:PythonLicense:AGPL-3.0Stargazers:38Issues:5Issues:0
Language:PythonLicense:MITStargazers:26Issues:1Issues:0

Aivis-Dataset

💠 Aivis: AI Voice Imitation System

Language:PythonLicense:MITStargazers:25Issues:2Issues:0
Language:PythonStargazers:25Issues:0Issues:0

RVC_Onnx_Infer

RVC Onnx Infer- Upgraded and simplified-ish

Language:PythonLicense:MITStargazers:19Issues:4Issues:3

Style-Bert-VITS2-DiscordBot

BertVITS2を使って読み上げてくれるDiscordの読み上げボットのプログラム

Language:PythonLicense:MITStargazers:17Issues:0Issues:0

PL-Bert-VITS2

VITS2 using Phoneme-Level Japanese BERT

Language:PythonLicense:Apache-2.0Stargazers:12Issues:3Issues:1

rvc-onnx-test

for onnx export test from rvc

Language:PythonLicense:MITStargazers:4Issues:3Issues:0

RVC_Onnx_Infer

RVC Onnx Infer- Upgraded and simplified-ish

Language:PythonStargazers:2Issues:1Issues:0

Retrieval-based-Voice-Conversion-WebUI

Use less than 10 minutes vocal to fast train a voice conversion model!

Language:PythonLicense:MITStargazers:1Issues:0Issues:0