yhZhai / idol

[ECCV 2024] IDOL: Unified Dual-Modal Latent Diffusion for Human-Centric Joint Video-Depth Generation

Home Page:https://yhzhai.github.io/idol/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Yuanhao Zhai1, Kevin Lin2, Linjie Li2, Chung-Ching Lin2, Jianfeng Wang2, Zhengyuan Yang2, David Doermann1, Junsong Yuan1, Zicheng Liu3, Lijuan Wang2

1State University of New Yort at Buffalo   |   2Microsoft  |   3Advanced Micro Devices

European Conference on Computer Vision (ECCV) 2024

 

TL;DR: Our IDOL enables human-centric joint video-depth generation, which could be rendered into realistic 2.5 videos.

All code and checkpoints will be released soon!

About

[ECCV 2024] IDOL: Unified Dual-Modal Latent Diffusion for Human-Centric Joint Video-Depth Generation

https://yhzhai.github.io/idol/

License:Apache License 2.0