TongHengcheng

followers

following

stars

Aire

Hay Kim's repositories

AnimateLCM

AnimateLCM: Let's Accelerate the Video Generation within 4 Steps!

Language:PythonMIT000

AniPortrait

AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation

Language:PythonApache-2.0000

AnyV2V

A Plug-and-Play Framework For Any Video-to-Video Editing Tasks

Language:Jupyter NotebookMIT000

Awesome-Video-Diffusion

A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.

000

BrushNet

The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"

Language:PythonNOASSERTION000

champ

Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance

Language:PythonApache-2.0000

ConsistI2V

ConsistI2V: Enhancing Visual Consistency for Image-to-Video Generation

Language:PythonMIT000

ControlNet_Plus_Plus

Inference code for: ControlNet++: Improving Conditional Controls with Efficient Consistency Feedback

Language:PythonMIT000

Ctrl-Adapter

Official implementation of Ctrl-Adapter: An Efficient and Versatile Framework for Adapting Diverse Controls to Any Diffusion Model

Language:PythonApache-2.0000

diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch

Language:PythonApache-2.0000

FRESCO

[CVPR 2024] FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation

Language:Jupyter NotebookNOASSERTION000

Dough

Dough is a open source tool for steering AI animations with precision.

NOASSERTION000

DynamiCrafter

DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors

Apache-2.0000

facefusion

Next generation face swapper and enhancer

NOASSERTION000

img2img-turbo

One-step image-to-image with Stable Diffusion turbo: sketch2image, day2night, and more

MIT000

InstantStyle

InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation 🔥

000

Lumina-T2X

Lumina-T2X is a model for Text to Any Modality Generation

MIT000

MagicTime

MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators

Apache-2.0000

MiniGemini

Official implementation for Mini-Gemini

Apache-2.0000

MoneyPrinterTurbo

利用大模型，一键生成短视频

MIT000

Monkey

【CVPR 2024】Monkey (LMM): Image Resolution and Text Label Are Important Things for Large Multi-modal Models

Language:PythonMIT000

MuseV

MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising

MIT000

Open-Sora-Plan

This project aim to reproduce Sora (Open AI T2V model), but we only have limited resource. We deeply wish the all open source community can contribute to this project.

MIT000

PowerPaint

000

PuLID

Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment

Apache-2.0000

Real3DPortrait

Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis; ICLR 2024 Spotlight; Official code

MIT000

RPG-DiffusionMaster

[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (PRG)

000

StreamingT2V

StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text

000

VAR

[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction"

MIT000

yolov9

Implementation of paper - YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information

Language:PythonGPL-3.0000