iris0329

Iris's starred repositories

ControlNet

Let us control diffusion models!

Language:PythonApache-2.029405 217 534

diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.

Language:PythonApache-2.024639 193 3917

fiftyone

The open-source tool for building high-quality datasets and computer vision models

Language:PythonApache-2.08001 57 1492

CogVLM

a state-of-the-art-level open visual language model | 多模态预训练模型

Language:PythonApache-2.05765 66 414

pyllama

LLaMA: Open and Efficient Foundation Language Models

Language:PythonGPL-3.02801 34 93

zero123

Zero-1-to-3: Zero-shot One Image to 3D Object (ICCV 2023)

Language:PythonMIT2631 43 124

Semantic-Segment-Anything

Automated dense category annotation engine that serves as the initial semantic labeling for the Segment Anything dataset (SA-1B).

Language:PythonApache-2.02077 19 57

Open3D-ML

An extension of Open3D to address 3D Machine Learning tasks

Language:PythonNOASSERTION1783 46 290

CoOp

Prompt Learning for Vision-Language Models (IJCV'22, CVPR'22)

Language:PythonMIT1636 15 78

Diffusion-Models-pytorch

Pytorch implementation of Diffusion Models (https://arxiv.org/pdf/2006.11239.pdf)

Language:PythonApache-2.01066 11 40

Monocular-Depth-Estimation-Toolbox

Monocular Depth Estimation Toolbox based on MMSegmentation.

Language:PythonApache-2.0896 15 97

Awesome-Open-Vocabulary

(TPAMI 2024) A Survey on Open Vocabulary Learning

769 24 12

GroupViT

Official PyTorch implementation of GroupViT: Semantic Segmentation Emerges from Text Supervision, CVPR 2022.

Language:PythonNOASSERTION716 11 64

GaussianDreamer

GaussianDreamer: Fast Generation from Text to 3D Gaussians by Bridging 2D and 3D Diffusion Models (CVPR 2024)

Language:PythonApache-2.0611 15 40

DenseCLIP

[CVPR 2022] DenseCLIP: Language-Guided Dense Prediction with Context-Aware Prompting

Language:Python504 3 53

Scan2CAD

[CVPR'19] Dataset and code used in the research project Scan2CAD: Learning CAD Model Alignment in RGB-D Scans

Language:C++NOASSERTION425 26 28

LLM-groundedDiffusion

LLM-grounded Diffusion: Enhancing Prompt Understanding of Text-to-Image Diffusion Models with Large Language Models (LLM-grounded Diffusion: LMD)

Language:Python397 13 18

Make-A-Protagonist

Make-A-Protagonist: Generic Video Editing with An Ensemble of Experts

Language:PythonApache-2.0316 2 23

ChatSim

[CVPR2024 Highlight] Editable Scene Simulation for Autonomous Driving via LLM-Agent Collaboration

Language:Python292 14 41

HumanoidAgents

Humanoid Agents: Platform for Simulating Human-like Generative Agents

Language:PythonApache-2.0248 4 2

RVT

Official Code for RVT-2 and RVT

Language:Jupyter NotebookNOASSERTION246 10 46

ScanRefer

[ECCV 2020] ScanRefer: 3D Object Localization in RGB-D Scans using Natural Language

Language:PythonNOASSERTION225 9 26

controlvideo

Official implementation for "ControlVideo: Adding Conditional Control for One Shot Text-to-Video Editing"

Language:PythonApache-2.0216 18 15

DiT-3D

🔥🔥🔥Official Codebase of "DiT-3D: Exploring Plain Diffusion Transformers for 3D Shape Generation"

Language:PythonApache-2.0204 12 21

DDP

Language:Python159 9 15

RayDF

🔥RayDF in PyTorch (NeurIPS 2023)

Language:PythonNOASSERTION104 3 9

Scan2Cap

[CVPR 2021] Scan2Cap: Context-aware Dense Captioning in RGB-D Scans

Language:PythonNOASSERTION99 7 23

3DVL_Codebase

[CVPR2022 Oral] 3DJCG: A Unified Framework for Joint Dense Captioning and Visual Grounding on 3D Point Clouds

Language:PythonNOASSERTION50 3 10

Open-Vocabulary-Affordance-Detection-in-3D-Point-Clouds

[IROS 2023] Open-Vocabulary Affordance Detection in 3d Point Clouds

Language:PythonMIT47 1 10

3D-VLP

This is the code related to "Context-aware Alignment and Mutual Masking for 3D-Language Pre-training" (CVPR 2023).

Language:PythonMIT24 5 3