Yue Cao's starred repositories
stable-diffusion-webui
Stable Diffusion web UI
GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
supervision
We write your reusable computer vision tools. 💜
StableCascade
Official Code for Stable Cascade
video-retalking
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
openblocks
🔥 🔥 🔥 The Open Source Retool Alternative
DeepSeek-Coder
DeepSeek Coder: Let the Code Write Itself
multidiffusion-upscaler-for-automatic1111
Tiled Diffusion and VAE optimize, licensed under CC BY-NC-SA 4.0
latent-consistency-model
Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference
Open-AnimateAnyone
Unofficial Implementation of Animate Anyone
consistencydecoder
Consistency Distilled Diff VAE
DemoFusion
Let us democratise high-resolution generation! (CVPR 2024)
Chat-Haruhi-Suzumiya
Chat凉宫春日, An open sourced Role-Playing chatbot Cheng Li, Ziang Leng, and others.
Neural-Network-Diffusion
We introduce a novel approach for parameter generation, named neural network diffusion (\textbf{p-diff}, p stands for parameter), which employs a standard latent diffusion model to synthesize a new set of parameters
Make-An-Audio
PyTorch Implementation of Make-An-Audio (ICML'23) with a Text-to-Audio Generative Model
disco-diffusion-wrapper
Implementation of disco-diffusion wrapper that could run on your own GPU with batch text input.
gptstore-prompts
Here are the Top 100 prompts on GPTStore, which we can use to learn and improve prompt engineering.
ring-flash-attention
Ring attention implementation with flash attention
AniPortraitGAN
This is a pytorch implementation of the following paper: AniPortraitGAN: Animatable 3D Portrait Generation from 2D Image Collections, SIGGRAPH Asia 2023.