haoranD

Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.

000

idify

Make ID photo right in the browser.

GPL-3.0000

MasQCLIP

(ICCV 2023) MasQCLIP for Open-Vocabulary Universal Image Segmentation

NOASSERTION000

MosaicFusion

MosaicFusion: Diffusion Models as Data Augmenters for Large Vocabulary Instance Segmentation

NOASSERTION000

on-isotropy-of-contrastive-SRL

000

openpilot

openpilot is an open source driver assistance system. openpilot performs the functions of Automated Lane Centering and Adaptive Cruise Control for 250+ supported car makes and models.

Language:PythonMIT000

OutfitAnyone

Outfit Anyone: Ultra-high quality virtual try-on for Any Clothing and Any Person

000

panacea

[CVPR2024] Official Repository of Paper "Panacea: Panoramic and Controllable Video Generation for Autonomous Driving"

Apache-2.0000

react-native_3d_store

000

rich-text-to-image

Rich-Text-to-Image Generation

MIT000

sd-webui-EasyPhoto

📷 EasyPhoto | Your Smart AI Photo Generator.

Apache-2.0000

TimeLlama

The official repo of TimeLlama, an instruction-finetuned Llama2 series that improve complex temporal reasoning ability.

Language:PythonMIT000

Tracking-Anything-with-DEVA

[ICCV 2023] Tracking Anything with Decoupled Video Segmentation

000

vditor

♏ 一款浏览器端的 Markdown 编辑器，支持所见即所得（富文本）、即时渲染（类似 Typora）和分屏预览模式。An In-browser Markdown editor, support WYSIWYG (Rich Text), Instant Rendering (Typora-like) and Split View modes.

MIT000

waymax

A JAX-based simulator for autonomous driving research.

NOASSERTION000

WebODM

User-friendly, commercial-grade software for processing aerial imagery. 🛩

AGPL-3.0000