houxuedong's starred repositories
syncnet_python
Out of time: automated lip sync in the wild
Lumina-T2X
Lumina-T2X is a unified framework for Text to Any Modality Generation
LAION-Face
The human face subset of LAION-400M for large-scale face pretraining.
InstantStyle
InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation 🔥
custom-diffusion
Custom Diffusion: Multi-Concept Customization of Text-to-Image Diffusion (CVPR 2023)
StoryImager
StoryImager: A Unified and Efficient Framework for Coherent Story Visualization and Completion
GaussianTalker
Official implementation of “GaussianTalker: Real-Time High-Fidelity Talking Head Synthesis with Audio-Driven 3D Gaussian Splatting” by Kyusun Cho, Joungbin Lee, Heeji Yoon, Yeobin Hong, Jaehoon Ko, Sangjun Ahn and Seungryong Kim
diff-sampler
[CVPR-2024, Highlight, Top 2.8%] Official implementation for "Fast ODE-based Sampling for Diffusion Models in Around 5 Steps".
Parts2Whole
[Arxiv 2024] From Parts to Whole: A Unified Reference Framework for Controllable Human Image Generation