Iris's starred repositories

ControlNet

Let us control diffusion models!

Language:PythonLicense:Apache-2.0Stargazers:29405Issues:217Issues:534

diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.

Language:PythonLicense:Apache-2.0Stargazers:24639Issues:193Issues:3917

fiftyone

The open-source tool for building high-quality datasets and computer vision models

Language:PythonLicense:Apache-2.0Stargazers:8001Issues:57Issues:1492

CogVLM

a state-of-the-art-level open visual language model | 多模态预训练模型

Language:PythonLicense:Apache-2.0Stargazers:5765Issues:66Issues:414

pyllama

LLaMA: Open and Efficient Foundation Language Models

Language:PythonLicense:GPL-3.0Stargazers:2801Issues:34Issues:93

zero123

Zero-1-to-3: Zero-shot One Image to 3D Object (ICCV 2023)

Language:PythonLicense:MITStargazers:2631Issues:43Issues:124

Semantic-Segment-Anything

Automated dense category annotation engine that serves as the initial semantic labeling for the Segment Anything dataset (SA-1B).

Language:PythonLicense:Apache-2.0Stargazers:2077Issues:19Issues:57

Open3D-ML

An extension of Open3D to address 3D Machine Learning tasks

Language:PythonLicense:NOASSERTIONStargazers:1783Issues:46Issues:290

CoOp

Prompt Learning for Vision-Language Models (IJCV'22, CVPR'22)

Language:PythonLicense:MITStargazers:1636Issues:15Issues:78

Diffusion-Models-pytorch

Pytorch implementation of Diffusion Models (https://arxiv.org/pdf/2006.11239.pdf)

Language:PythonLicense:Apache-2.0Stargazers:1066Issues:11Issues:40

Monocular-Depth-Estimation-Toolbox

Monocular Depth Estimation Toolbox based on MMSegmentation.

Language:PythonLicense:Apache-2.0Stargazers:896Issues:15Issues:97

Awesome-Open-Vocabulary

(TPAMI 2024) A Survey on Open Vocabulary Learning

GroupViT

Official PyTorch implementation of GroupViT: Semantic Segmentation Emerges from Text Supervision, CVPR 2022.

Language:PythonLicense:NOASSERTIONStargazers:716Issues:11Issues:64

GaussianDreamer

GaussianDreamer: Fast Generation from Text to 3D Gaussians by Bridging 2D and 3D Diffusion Models (CVPR 2024)

Language:PythonLicense:Apache-2.0Stargazers:611Issues:15Issues:40

DenseCLIP

[CVPR 2022] DenseCLIP: Language-Guided Dense Prediction with Context-Aware Prompting

Scan2CAD

[CVPR'19] Dataset and code used in the research project Scan2CAD: Learning CAD Model Alignment in RGB-D Scans

Language:C++License:NOASSERTIONStargazers:425Issues:26Issues:28

LLM-groundedDiffusion

LLM-grounded Diffusion: Enhancing Prompt Understanding of Text-to-Image Diffusion Models with Large Language Models (LLM-grounded Diffusion: LMD)

Make-A-Protagonist

Make-A-Protagonist: Generic Video Editing with An Ensemble of Experts

Language:PythonLicense:Apache-2.0Stargazers:316Issues:2Issues:23

ChatSim

[CVPR2024 Highlight] Editable Scene Simulation for Autonomous Driving via LLM-Agent Collaboration

HumanoidAgents

Humanoid Agents: Platform for Simulating Human-like Generative Agents

Language:PythonLicense:Apache-2.0Stargazers:248Issues:4Issues:2

RVT

Official Code for RVT-2 and RVT

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:246Issues:10Issues:46

ScanRefer

[ECCV 2020] ScanRefer: 3D Object Localization in RGB-D Scans using Natural Language

Language:PythonLicense:NOASSERTIONStargazers:225Issues:9Issues:26

controlvideo

Official implementation for "ControlVideo: Adding Conditional Control for One Shot Text-to-Video Editing"

Language:PythonLicense:Apache-2.0Stargazers:216Issues:18Issues:15

DiT-3D

🔥🔥🔥Official Codebase of "DiT-3D: Exploring Plain Diffusion Transformers for 3D Shape Generation"

Language:PythonLicense:Apache-2.0Stargazers:204Issues:12Issues:21

RayDF

🔥RayDF in PyTorch (NeurIPS 2023)

Language:PythonLicense:NOASSERTIONStargazers:104Issues:3Issues:9

Scan2Cap

[CVPR 2021] Scan2Cap: Context-aware Dense Captioning in RGB-D Scans

Language:PythonLicense:NOASSERTIONStargazers:99Issues:7Issues:23

3DVL_Codebase

[CVPR2022 Oral] 3DJCG: A Unified Framework for Joint Dense Captioning and Visual Grounding on 3D Point Clouds

Language:PythonLicense:NOASSERTIONStargazers:50Issues:3Issues:10

Open-Vocabulary-Affordance-Detection-in-3D-Point-Clouds

[IROS 2023] Open-Vocabulary Affordance Detection in 3d Point Clouds

Language:PythonLicense:MITStargazers:47Issues:1Issues:10

3D-VLP

This is the code related to "Context-aware Alignment and Mutual Masking for 3D-Language Pre-training" (CVPR 2023).

Language:PythonLicense:MITStargazers:24Issues:5Issues:3