yasyune's starred repositories

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Language:PythonLicense:MITStargazers:27661Issues:184Issues:877

OpenVoice

Instant voice cloning by MyShell.

Language:PythonLicense:MITStargazers:26829Issues:207Issues:198

AnimateAnyone

Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation

magic-animate

[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model

Language:PythonLicense:BSD-3-ClauseStargazers:10061Issues:103Issues:140

StreamDiffusion

StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation

Language:PythonLicense:Apache-2.0Stargazers:9180Issues:77Issues:101

clone-voice

A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具,使用你的音色或任意声音来录制音频

Language:PythonLicense:NOASSERTIONStargazers:6602Issues:35Issues:116

StyleTTS2

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

Language:PythonLicense:MITStargazers:4302Issues:79Issues:167

Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

Language:PythonLicense:MITStargazers:4123Issues:55Issues:120

HierSpeechpp

The official implementation of HierSpeech++

Language:PythonLicense:MITStargazers:1116Issues:57Issues:45

resemble-enhance

AI powered speech denoising and enhancement

Language:PythonLicense:MITStargazers:1041Issues:15Issues:31

Style-Bert-VITS2

Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles.

Language:PythonLicense:AGPL-3.0Stargazers:579Issues:14Issues:89
Language:PythonLicense:Apache-2.0Stargazers:248Issues:13Issues:15

megatts2

Unoffical implementation of Megatts2

Language:PythonLicense:MITStargazers:234Issues:22Issues:19

EasyBertVits2

文章から感情豊かな音声を生成する Bert-VITS2 を簡単に使えます。

Language:BatchfileLicense:MITStargazers:134Issues:4Issues:4

Aivis

💠 Aivis: AI Voice Imitation System

Language:PythonLicense:MITStargazers:126Issues:3Issues:2
Language:PythonLicense:MITStargazers:78Issues:5Issues:4

APNet2

Source code of APNet2, a vocoder

Language:PythonLicense:MITStargazers:46Issues:2Issues:2

Applio-Installer

Create, Experiment, Enjoy with Applio: Now Easier, Simpler and Faster!

Language:HTMLStargazers:45Issues:3Issues:0

descript-audio-vae

VAE modified from Descript Audio Codec, which replaces the RVQ with VAE

Language:PythonLicense:MITStargazers:39Issues:6Issues:1

Bert-VITS2-Audio-Generator

GUI TTS Application based on Bert-VITS2

Language:PythonStargazers:27Issues:2Issues:0

Aivis-Dataset

💠 Aivis: AI Voice Imitation System

Language:PythonLicense:MITStargazers:25Issues:2Issues:0
Language:PythonLicense:MITStargazers:25Issues:1Issues:0

PL-Bert-VITS2

VITS2 using Phoneme-Level Japanese BERT

Language:PythonLicense:Apache-2.0Stargazers:12Issues:3Issues:1

RVC_Onnx_Infer

RVC Onnx Infer- Upgraded and simplified-ish

Language:PythonStargazers:11Issues:4Issues:0

Bert-VITS2-JP

with shell script setup

Language:PythonLicense:AGPL-3.0Stargazers:5Issues:0Issues:0

rvc-onnx-test

for onnx export test from rvc

Language:PythonLicense:MITStargazers:4Issues:3Issues:0

RVC_Onnx_Infer

RVC Onnx Infer- Upgraded and simplified-ish

Language:PythonStargazers:2Issues:1Issues:0

Retrieval-based-Voice-Conversion-WebUI

Use less than 10 minutes vocal to fast train a voice conversion model!

Language:PythonLicense:MITStargazers:1Issues:0Issues:0