Arking1995's starred repositories

fiftyone

The open-source tool for building high-quality datasets and computer vision models

Language:PythonLicense:Apache-2.0Stargazers:8004Issues:56Issues:1493

DiT

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Language:PythonLicense:NOASSERTIONStargazers:5875Issues:47Issues:77

CogVideo

Text-to-video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Language:PythonLicense:Apache-2.0Stargazers:5665Issues:107Issues:104

mmpose

OpenMMLab Pose Estimation Toolbox and Benchmark.

Language:PythonLicense:Apache-2.0Stargazers:5472Issues:55Issues:1449

InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Language:PythonLicense:MITStargazers:5057Issues:49Issues:465

kubric

A data generation pipeline for creating semi-realistic synthetic multi-object videos with rich annotations such as instance segmentation masks, depth maps, and optical flow.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2258Issues:42Issues:186

LayerDiffuse

Transparent Image Layer Diffusion using Latent Transparency

sd-akashic

A compendium of informations regarding Stable Diffusion (SD)

OMG-Seg

OMG-LLaVA and OMG-Seg codebase

Language:PythonLicense:NOASSERTIONStargazers:1185Issues:23Issues:28

Bunny

A family of lightweight multimodal models.

Language:PythonLicense:Apache-2.0Stargazers:853Issues:19Issues:107

omni3d

Code release for "Omni3D A Large Benchmark and Model for 3D Object Detection in the Wild"

Language:PythonLicense:NOASSERTIONStargazers:698Issues:23Issues:51

objaverse-xl

🪐 Objaverse-XL is a Universe of 10M+ 3D Objects. Contains API Scripts for Downloading and Processing!

Language:PythonLicense:Apache-2.0Stargazers:684Issues:8Issues:46

CityDreamer

The official implementation of "CityDreamer: Compositional Generative Model of Unbounded 3D Cities". (Xie et al., CVPR 2024)

Language:PythonLicense:NOASSERTIONStargazers:585Issues:26Issues:23

conceptual-captions

Conceptual Captions is a dataset containing (image-URL, caption) pairs designed for the training and evaluation of machine learned image captioning systems.

Language:ShellLicense:NOASSERTIONStargazers:511Issues:18Issues:19

ADE20K

ADE20K Dataset

Language:Jupyter NotebookStargazers:305Issues:24Issues:43

MagicBrush

[NeurIPS'23] "MagicBrush: A Manually Annotated Dataset for Instruction-Guided Image Editing".

Language:PythonLicense:NOASSERTIONStargazers:291Issues:5Issues:17

libcom

Image composition toolbox: everything you want to know about image composition or object insertion

Language:PythonLicense:Apache-2.0Stargazers:266Issues:12Issues:35

vision-language-models-are-bows

Experiments and data for the paper "When and why vision-language models behave like bags-of-words, and what to do about it?" Oral @ ICLR 2023

Language:PythonLicense:MITStargazers:227Issues:8Issues:37

T2I-CompBench

[Neurips 2023] T2I-CompBench: A Comprehensive Benchmark for Open-world Compositional Text-to-image Generation

Language:PythonLicense:MITStargazers:180Issues:2Issues:19

SyntheticData

Is synthetic data from generative models ready for image recognition?

Language:PythonLicense:Apache-2.0Stargazers:172Issues:13Issues:9

National_interest_waiver_waittime

USCIS Employment-based-2 national interest waiver wait time

MEBOW

Code for "MEBOW: Monocular Estimation of Body Orientation In the Wild", CVPR 2020

DST3D

Official implementation of "Generating images with 3D annotations using diffusion models".

Language:PythonLicense:MITStargazers:56Issues:12Issues:0

LLaVA-1.6-ft

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonLicense:Apache-2.0Stargazers:27Issues:2Issues:0

Super-CLEVR

Code for paper "Super-CLEVR: A Virtual Benchmark to Diagnose Domain Robustness in Visual Reasoning"

Language:PythonLicense:NOASSERTIONStargazers:20Issues:3Issues:1
Language:PythonLicense:NOASSERTIONStargazers:18Issues:3Issues:4

imagenet3d

ImageNet3D: Towards General-Purpose Object-Level 3D Understanding

Language:PythonStargazers:13Issues:2Issues:0