Jing He's starred repositories

Awesome-Multimodal-Large-Language-Models

:sparkles::sparkles:Latest Advances on Multimodal Large Language Models

StoryDiffusion

Create Magic Story!

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:5765Issues:88Issues:140

IC-Light

More relighting!

Language:PythonLicense:Apache-2.0Stargazers:4839Issues:47Issues:77

IDM-VTON

[ECCV2024] IDM-VTON : Improving Diffusion Models for Authentic Virtual Try-on in the Wild

Marigold

[CVPR 2024 - Oral, Best Paper Award Candidate] Marigold: Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation

Language:PythonLicense:Apache-2.0Stargazers:2184Issues:41Issues:91

InstantStyle

InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation 🔥

Language:Jupyter NotebookStargazers:1559Issues:21Issues:36

ODISE

Official PyTorch implementation of ODISE: Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models [CVPR 2023 Highlight]

Language:PythonLicense:NOASSERTIONStargazers:848Issues:40Issues:42

Grounding-DINO-1.5-API

API for Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series

Language:PythonLicense:Apache-2.0Stargazers:702Issues:11Issues:38

GeoWizard

[ECCV'24] GeoWizard: Unleashing the Diffusion Priors for 3D Geometry Estimation from a Single Image

DSINE

[CVPR 2024 Oral] Rethinking Inductive Biases for Surface Normal Estimation

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:681Issues:10Issues:14

Ctrl-Adapter

Official implementation of Ctrl-Adapter: An Efficient and Versatile Framework for Adapting Diverse Controls to Any Diffusion Model

Language:PythonLicense:Apache-2.0Stargazers:374Issues:22Issues:22

Video-P2P

Video-P2P: Video Editing with Cross-attention Control

Inf-DiT

Official implementation of Inf-DiT: Upsampling Any-Resolution Image with Memory-Efficient Diffusion Transformer

Language:PythonLicense:Apache-2.0Stargazers:359Issues:22Issues:26

T-GATE

T-GATE: Temporally Gating Attention to Accelerate Diffusion Model for Free!

Language:PythonLicense:MITStargazers:337Issues:12Issues:15

CosmicMan

CosmicMan: A Text-to-Image Foundation Model for Humans (CVPR 2024)

CAT-Seg

Official Implementation of "CAT-Seg🐱: Cost Aggregation for Open-Vocabulary Semantic Segmentation"

Language:PythonLicense:MITStargazers:241Issues:6Issues:36

probe3d

[CVPR 2024] Probing the 3D Awareness of Visual Foundation Models

Language:PythonLicense:MITStargazers:237Issues:5Issues:7
Language:PythonLicense:Apache-2.0Stargazers:186Issues:30Issues:2

SpeeD

SpeeD: A Closer Look at Time Steps is Worthy of Triple Speed-Up for Diffusion Model Training

Language:PythonLicense:Apache-2.0Stargazers:148Issues:9Issues:7

ElasticDiffusion-official

The official Pytorch Implementation for ElasticDiffusion: Training-free Arbitrary Size Image Generation (CVPR 2024)

GenPercept

GenPercept: Diffusion Models Trained with Large Data Are Transferable Visual Models

Language:PythonLicense:BSD-2-ClauseStargazers:114Issues:5Issues:11

OIR

[ICLR 2024] Official PyTorch/Diffusers implementation of "Object-aware Inversion and Reassembly for Image Editing"

svd-mv

Unofficial Implementation of "Stable Video Diffusion Multi-View"

Language:PythonLicense:MITStargazers:73Issues:5Issues:2

TrackDiffusion

Official PyTorch implementation of TrackDiffusion (https://arxiv.org/abs/2312.00651)

GeoDiffusion

Official PyTorch implementation of GeoDiffusion in ICLR 2024 (https://arxiv.org/abs/2306.04607)

Language:PythonLicense:MITStargazers:55Issues:4Issues:20

dmp

[CVPR 2024] Exploiting Diffusion Prior for Generalizable Dense Prediction

Language:PythonLicense:Apache-2.0Stargazers:51Issues:6Issues:7

TADP

Text-Image Alignment for Diffusion-based Perception (TADP) - CVPR 2024

Language:PythonLicense:Apache-2.0Stargazers:17Issues:5Issues:4