博文's starred repositories
ShadowsocksX-NG
Next Generation of ShadowsocksX
GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Bert-VITS2
vits2 backbone with multilingual-bert
vocal-separate
an extremely simple tool for separating vocals and background music, completely localized for web operation, using 2stems/4stems/5stems models 这是一个极简的人声和背景音乐分离工具,本地化网页操作,无需连接外网
safetensors
Simple, safe way to store and distribute tensors
flash-linear-attention
Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton
text-generation-webui
A Gradio web UI for Large Language Models.
AI-For-Beginners
12 Weeks, 24 Lessons, AI for All!
the-super-tiny-compiler
:snowman: Possibly the smallest compiler ever
video-retalking
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
Fay
Fay is an open-source digital human framework integrating language models and digital characters. It offers retail, assistant, and agent versions for diverse applications like virtual shopping guides, broadcasters, assistants, waiters, teachers, and voice or text-based mobile assistants.
self-operating-computer
A framework to enable multimodal models to operate a computer.