Kai Jin's starred repositories
HunyuanDiT
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
StableVITON
[CVPR2024] StableVITON: Learning Semantic Correspondence with Latent Diffusion Model for Virtual Try-On
IP-Adapter
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
magic-animate
[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model
FaceStudio
Put Your Face Everywhere in Seconds.
cross_modal_adaptation
Cross-modal few-shot adaptation with CLIP
Deep3DFaceRecon_pytorch
Accurate 3D Face Reconstruction with Weakly-Supervised Learning: From Single Image to Image Set (CVPRW 2019). A PyTorch implementation.
generative-models
Generative Models by Stability AI
sd-webui-EasyPhoto
📷 EasyPhoto | Your Smart AI Photo Generator.