Rice Cake's starred repositories

ComfyUI

The most powerful and modular stable diffusion GUI, api and backend with a graph/nodes interface.

Language:PythonLicense:GPL-3.0Stargazers:43692Issues:342Issues:2592

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Language:PythonLicense:MITStargazers:29993Issues:190Issues:990

ChatTTS

A generative speech model for daily dialogue.

Language:PythonLicense:AGPL-3.0Stargazers:28454Issues:168Issues:424

1Panel

🔥🔥🔥 Web-based linux server management control panel. / 现代化、开源的 Linux 服务器运维管理面板。

Language:GoLicense:GPL-3.0Stargazers:20797Issues:161Issues:3405

shap-e

Generate 3D objects conditioned on text or images

Language:PythonLicense:MITStargazers:11478Issues:239Issues:112

go-proxy-bingai

用 Vue3 和 Go 搭建的微软 New Bing 演示站点,拥有一致的 UI 体验,支持 ChatGPT 提示词,国内可用。

Language:HTMLLicense:MITStargazers:8870Issues:54Issues:379

Bert-VITS2

vits2 backbone with multilingual-bert

Language:PythonLicense:AGPL-3.0Stargazers:7596Issues:45Issues:0

fish-speech

Brand new TTS solution

Language:PythonLicense:NOASSERTIONStargazers:6630Issues:58Issues:270

openai-scf-proxy

使用腾讯云函数一分钟搭建 OpenAI 免翻墙代理

hifi-gan

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Language:PythonLicense:MITStargazers:1855Issues:32Issues:160

MoeTTS

Speech synthesis model /inference GUI repo for galgame characters based on Tacotron2, Hifigan, VITS and Diff-svc

Style-Bert-VITS2

Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles.

Language:PythonLicense:AGPL-3.0Stargazers:637Issues:14Issues:97

vits2

VITS2: Improving Quality and Efficiency of Single-Stage Text-to-Speech with Adversarial Learning and Architecture Design

Language:Jupyter NotebookLicense:MITStargazers:433Issues:13Issues:14

Bert-VITS2-UI

BertVITS2前端界面

Language:VueLicense:AGPL-3.0Stargazers:279Issues:3Issues:15

GalgameReverse

Reverse Projects for Galgame

ar-vits

text to speech using autoregressive transformer and VITS

Language:PythonLicense:MITStargazers:216Issues:15Issues:4

audio-preprocess

Preprocess Audio for training

Language:PythonLicense:Apache-2.0Stargazers:201Issues:8Issues:6

MoeSR

An application specialized in image super-resolution for ACGN illustrations and Visual Novel CG. 专注于插画/Galgame CG等ACGN领域的图像超分辨率的应用

Language:JavaScriptLicense:GPL-3.0Stargazers:177Issues:0Issues:4

ColorSplitter

A cli tool for split vocal timbre.

Language:PythonLicense:MITStargazers:147Issues:0Issues:3

BGIKit

Script Decoder and Encoder for Ethornell Buriko General Interpreter Galgame Engine, Short as BGI

SOFA

SOFA: Singing-Oriented Forced Aligner

Language:PythonLicense:MITStargazers:93Issues:5Issues:9
Language:PythonLicense:MITStargazers:86Issues:5Issues:5

LangSegment

It is a multi-lingual (97 languages) text content automatic recognition and segmentation tool. 强大的TTS多语言(97种语言)混合文本内容自动分词工具。

Language:PythonStargazers:52Issues:2Issues:0

MagVITS

VITS with phoneme-level prosody modeling based on MaskGIT

Stargazers:34Issues:0Issues:0

SoftPal-Tool

SoftPal Engine script disassembly and editing tool.

Language:PythonLicense:GPL-3.0Stargazers:15Issues:1Issues:0

Irotorinosekai_18

色鸟鸟移植英版r18补丁到日版/汉化版

Language:C++Stargazers:5Issues:2Issues:0

bilibiliSearchAI

B站内测的搜索AI(类 newBing)逆向接口

Language:GoLicense:MITStargazers:4Issues:1Issues:0