Beast code in Giters

Jing He's starred repositories

CVPR2024-Papers-with-Code

CVPR 2024 论文和开源项目合集

17660 288 202

Awesome-Multimodal-Large-Language-Models

:sparkles::sparkles:Latest Advances on Multimodal Large Language Models

11663 268 108

StoryDiffusion

Create Magic Story!

Language:Jupyter NotebookApache-2.05765 88 140

IC-Light

More relighting!

Language:PythonApache-2.04839 47 77

IDM-VTON

[ECCV2024] IDM-VTON : Improving Diffusion Models for Authentic Virtual Try-on in the Wild

Language:Python3590 54 143

Marigold

[CVPR 2024 - Oral, Best Paper Award Candidate] Marigold: Repurposing Diffusion-Based Image Generators for Monocular Depth Estimation

Language:PythonApache-2.02184 41 91

InstantStyle

InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation 🔥

Language:Jupyter Notebook1559 21 36

ODISE

Official PyTorch implementation of ODISE: Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models [CVPR 2023 Highlight]

Language:PythonNOASSERTION848 40 42

Grounding-DINO-1.5-API

API for Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series

Language:PythonApache-2.0702 11 38

GeoWizard

[ECCV'24] GeoWizard: Unleashing the Diffusion Priors for 3D Geometry Estimation from a Single Image

Language:Python699 26 34

DSINE

[CVPR 2024 Oral] Rethinking Inductive Biases for Surface Normal Estimation

Language:Jupyter NotebookNOASSERTION681 10 14

Ctrl-Adapter

Official implementation of Ctrl-Adapter: An Efficient and Versatile Framework for Adapting Diverse Controls to Any Diffusion Model

Language:PythonApache-2.0374 22 22

Video-P2P

Video-P2P: Video Editing with Cross-attention Control

Language:Python369 9 16

Inf-DiT

Official implementation of Inf-DiT: Upsampling Any-Resolution Image with Memory-Efficient Diffusion Transformer

Language:PythonApache-2.0359 22 26

T-GATE

T-GATE: Temporally Gating Attention to Accelerate Diffusion Model for Free!

Language:PythonMIT337 12 15

CosmicMan

CosmicMan: A Text-to-Image Foundation Model for Humans (CVPR 2024)

Language:Python301 38 12

CAT-Seg

Official Implementation of "CAT-Seg🐱: Cost Aggregation for Open-Vocabulary Semantic Segmentation"

Language:PythonMIT241 6 36

probe3d

[CVPR 2024] Probing the 3D Awareness of Visual Foundation Models

Language:PythonMIT237 5 7

General-World-Models-Survey

MIT233 120

IntrinsicAnything

Language:PythonApache-2.0186 30 2

SpeeD

SpeeD: A Closer Look at Time Steps is Worthy of Triple Speed-Up for Diffusion Model Training

Language:PythonApache-2.0148 9 7

ElasticDiffusion-official

The official Pytorch Implementation for ElasticDiffusion: Training-free Arbitrary Size Image Generation (CVPR 2024)

Language:Python138 7 3

behavior-vision-suite.github.io

Language:CSSMIT132 3 7

GenPercept

GenPercept: Diffusion Models Trained with Large Data Are Transferable Visual Models

Language:PythonBSD-2-Clause114 5 11

OIR

[ICLR 2024] Official PyTorch/Diffusers implementation of "Object-aware Inversion and Reassembly for Image Editing"

Language:Python79 9 2

svd-mv

Unofficial Implementation of "Stable Video Diffusion Multi-View"

Language:PythonMIT73 5 2

TrackDiffusion

Official PyTorch implementation of TrackDiffusion (https://arxiv.org/abs/2312.00651)

Language:Python61 6 8

GeoDiffusion

Official PyTorch implementation of GeoDiffusion in ICLR 2024 (https://arxiv.org/abs/2306.04607)

Language:PythonMIT55 4 20

dmp

[CVPR 2024] Exploiting Diffusion Prior for Generalizable Dense Prediction

Language:PythonApache-2.051 6 7

TADP

Text-Image Alignment for Diffusion-based Perception (TADP) - CVPR 2024

Language:PythonApache-2.017 5 4