Huanyu's repositories
VASA-1
VASA-1
GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
OpenFace
OpenFace – a state-of-the art tool intended for facial landmark detection, head pose estimation, facial action unit recognition, and eye-gaze estimation.
OpenVoice
Instant voice cloning by MyShell.
ComfyUI
The most powerful and modular stable diffusion GUI with a graph/nodes interface.
generative-ai-for-beginners
12 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/
RasaGPT
💬 RasaGPT is the first headless LLM chatbot platform built on top of Rasa and Langchain. Built w/ Rasa, FastAPI, Langchain, LlamaIndex, SQLModel, pgvector, ngrok, telegram
GFPGAN
GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.
stable-diffusion-webui
Stable Diffusion web UI
aigc-controller
控制 aigc 后台任务
aigc-video-api
ppt 转 视频中,负责管理后台进程
stable-diffusion
A latent text-to-image diffusion model
multidiffusion-upscaler-for-automatic1111
Tiled Diffusion and VAE optimize, licensed under CC BY-NC-SA 4.0
so-vits-svc
SoftVC VITS Singing Voice Conversion
alpaca.cpp
Locally run an Instruction-Tuned Chat-Style LLM
wechatGPT
Conversational RPA SDK for Chatbot Makers
whisper
Robust Speech Recognition via Large-Scale Weak Supervision
FFmpeg
Mirror of https://git.ffmpeg.org/ffmpeg.git
mw-best-practices
Node.js 全栈开发之 Midway.js 最佳实践
scrapy-proxy
scrapy with proxy for large project