zhangjb416's starred repositories
mobile_manipulation_papers
Papers in Mobile Manipulation (Personal Collection)
ScanReason
[ECCV 2024] Empowering 3D Visual Grounding with Reasoning Capabilities
embodied-generalist
[ICML 2024] Official code repository for 3D embodied generalist agent LEO
OmniGibson
OmniGibson: a platform for accelerating Embodied AI research built upon NVIDIA's Omniverse engine. Join our Discord for support: https://discord.gg/bccR5vGFEx
Depth-Anything-V2
Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation
Grounded_3D-LLM
Code&Data for Grounded 3D-LLM with Referent Tokens
EmbodiedScan
[CVPR 2024] EmbodiedScan: A Holistic Multi-Modal 3D Perception Suite Towards Embodied AI
SceneTracker
SceneTracker: Long-term Scene Flow Estimation Network
Track-Anything
Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.
Semantic-Segment-Anything
Automated dense category annotation engine that serves as the initial semantic labeling for the Segment Anything dataset (SA-1B).
Depth-Anything
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation