Iris's starred repositories
ControlNet
Let us control diffusion models!
Semantic-Segment-Anything
Automated dense category annotation engine that serves as the initial semantic labeling for the Segment Anything dataset (SA-1B).
Diffusion-Models-pytorch
Pytorch implementation of Diffusion Models (https://arxiv.org/pdf/2006.11239.pdf)
Monocular-Depth-Estimation-Toolbox
Monocular Depth Estimation Toolbox based on MMSegmentation.
Awesome-Open-Vocabulary
(TPAMI 2024) A Survey on Open Vocabulary Learning
GaussianDreamer
GaussianDreamer: Fast Generation from Text to 3D Gaussians by Bridging 2D and 3D Diffusion Models (CVPR 2024)
LLM-groundedDiffusion
LLM-grounded Diffusion: Enhancing Prompt Understanding of Text-to-Image Diffusion Models with Large Language Models (LLM-grounded Diffusion: LMD)
Make-A-Protagonist
Make-A-Protagonist: Generic Video Editing with An Ensemble of Experts
HumanoidAgents
Humanoid Agents: Platform for Simulating Human-like Generative Agents
controlvideo
Official implementation for "ControlVideo: Adding Conditional Control for One Shot Text-to-Video Editing"
3DVL_Codebase
[CVPR2022 Oral] 3DJCG: A Unified Framework for Joint Dense Captioning and Visual Grounding on 3D Point Clouds
Open-Vocabulary-Affordance-Detection-in-3D-Point-Clouds
[IROS 2023] Open-Vocabulary Affordance Detection in 3d Point Clouds