skskgrowl's starred repositories

openvla

OpenVLA: An open-source vision-language-action model for robotic manipulation.

Language:PythonLicense:MITStargazers:1309Issues:22Issues:141

visualnav-transformer

Official code and checkpoint release for mobile robot foundation models: GNM, ViNT, and NoMaD.

Language:PythonLicense:MITStargazers:598Issues:32Issues:48

Awesome-Robotics-3D

A curated list of 3D Vision papers relating to Robotics domain in the era of large models i.e. LLMs/VLMs, inspired by awesome-computer-vision, including papers, codes, and related websites

Awesome-World-Model

Collect some World Models for Autonomous Driving papers.

OccWorld

[ECCV 2024] 3D World Model for Autonomous Driving

Language:PythonLicense:Apache-2.0Stargazers:373Issues:9Issues:28

Drive-WM

[CVPR 2024] A world model for autonomous driving.

Language:PythonLicense:Apache-2.0Stargazers:309Issues:22Issues:5

ViDAR

[CVPR 2024 Highlight] Visual Point Cloud Forecasting

Language:PythonLicense:Apache-2.0Stargazers:278Issues:9Issues:42

3D-Occupancy-Perception

[Information Fusion 2024] A Survey on Occupancy Perception for Autonomous Driving: The Information Fusion Perspective

GaussianOcc

GaussianOcc: Fully Self-supervised and Efficient 3D Occupancy Estimation with Gaussian Splatting

Language:PythonLicense:Apache-2.0Stargazers:207Issues:11Issues:26

Cam4DOcc

[CVPR 2024] Cam4DOcc: Benchmark for Camera-Only 4D Occupancy Forecasting in Autonomous Driving Applications

Language:PythonLicense:MITStargazers:203Issues:11Issues:17

PaSCo

[CVPR 2024 Oral, Best Paper Award Candidate] Official repository of "PaSCo: Urban 3D Panoptic Scene Completion with Uncertainty Awareness"

Language:PythonLicense:Apache-2.0Stargazers:168Issues:12Issues:23

NavGPT

[AAAI 2024] Official implementation of NavGPT: Explicit Reasoning in Vision-and-Language Navigation with Large Language Models

Language:PythonLicense:MITStargazers:149Issues:2Issues:11

ELM

[ECCV 2024] Embodied Understanding of Driving Scenarios

pyramid-discrete-diffusion

Official implementation of paper "Pyramid Diffusion for Fine 3D Large Scene Generation" (ECCV 2024 Oral)

Language:PythonLicense:MITStargazers:104Issues:8Issues:9

LAW

Enhancing End-to-End Autonomous Driving with Latent World Model

osp

[ECCV 2024] Occupancy as Set of Points

Language:PythonLicense:MITStargazers:81Issues:6Issues:6

NavGPT-2

[ECCV 2024] Official implementation of NavGPT-2: Unleashing Navigational Reasoning Capability for Large Vision-Language Models

Language:PythonLicense:MITStargazers:79Issues:7Issues:5

Coopernaut

Coopernaut: End-to-End Driving with Cooperative Perception for Networked Vehicles

Language:Jupyter NotebookStargazers:79Issues:2Issues:13

MM-VUFM4DS

A systematic survey of multi-modal and multi-task visual understanding foundation models for driving scenarios

Language:Jupyter NotebookLicense:MITStargazers:42Issues:3Issues:2

HTCL

Official PyTorch Implementation of HTCL (ECCV 2024): Hierarchical Temporal Context Learning for Camera-based Semantic Scene Completion

Language:PythonLicense:Apache-2.0Stargazers:34Issues:3Issues:3

CoHFF

[CVPR 2024] Collaborative Semantic Occupancy Prediction with Hybrid Feature Fusion in Connected Automated Vehicles

MambaOcc

MambaOcc: Visual State Space Model for BEV-based Occupancy Prediction with Local Adaptive Reordering

C-Instructor

[ECCV 2024] Official implementation of C-Instructor: Controllable Navigation Instruction Generation with Chain of Thought Prompting

MARL-CCE

ECCV[2024] "Modelling Competitive Behaviors in Autonomous Driving Under Generative World Model" official implement

License:BSD-3-ClauseStargazers:5Issues:0Issues:0