MingJian.L's starred repositories
GroundingDINO
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
Awesome-LLM-3D
Awesome-LLM-3D: a curated list of Multi-modal Large Language Model in 3D world Resources
LanguageBind
【ICLR 2024🔥】 Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment
Awesome-Text-to-3D
A growing curation of Text-to-3D, Diffusion-to-3D works.
DriveDreamer
[ECCV 2024] DriveDreamer: Towards Real-world-driven World Models for Autonomous Driving
OmniScient-Model
This repo contains the code for our paper Towards Open-Ended Visual Recognition with Large Language Model
distill-bev
DistillBEV: Boosting Multi-Camera 3D Object Detection with Cross-Modal Knowledge Distillation (ICCV 2023)
Lifelong-MonoDepth
About official Pytorch implementation of "Lifelong-MonoDepth: Lifelong Learning for Multi-Domain Monocular Metric Depth Estimation
Active_room_segmentation
Code for Human cognition-inspired active room segmentation
songzhuoran.github.io
My homepage.