Hay Kim's repositories
AnimateLCM
AnimateLCM: Let's Accelerate the Video Generation within 4 Steps!
AniPortrait
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation
AnyV2V
A Plug-and-Play Framework For Any Video-to-Video Editing Tasks
Awesome-Video-Diffusion
A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.
BrushNet
The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"
champ
Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance
ConsistI2V
ConsistI2V: Enhancing Visual Consistency for Image-to-Video Generation
ControlNet_Plus_Plus
Inference code for: ControlNet++: Improving Conditional Controls with Efficient Consistency Feedback
Ctrl-Adapter
Official implementation of Ctrl-Adapter: An Efficient and Versatile Framework for Adapting Diverse Controls to Any Diffusion Model
diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
FRESCO
[CVPR 2024] FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation
Dough
Dough is a open source tool for steering AI animations with precision.
DynamiCrafter
DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
facefusion
Next generation face swapper and enhancer
img2img-turbo
One-step image-to-image with Stable Diffusion turbo: sketch2image, day2night, and more
InstantStyle
InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation 🔥
Lumina-T2X
Lumina-T2X is a model for Text to Any Modality Generation
MagicTime
MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators
MiniGemini
Official implementation for Mini-Gemini
MoneyPrinterTurbo
利用大模型,一键生成短视频
Monkey
【CVPR 2024】Monkey (LMM): Image Resolution and Text Label Are Important Things for Large Multi-modal Models
MuseV
MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising
Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), but we only have limited resource. We deeply wish the all open source community can contribute to this project.
PuLID
Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment
Real3DPortrait
Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis; ICLR 2024 Spotlight; Official code
RPG-DiffusionMaster
[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (PRG)
StreamingT2V
StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text
VAR
[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction"
yolov9
Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information