BestSonny

[ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted for spatiotemporal video representation. We also introduce a rigorous 'Quantitative Evaluation Benchmarking' for video-based conversational models.

Language:PythonCC-BY-4.01177 14 119

VideoLLaMA2

VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs

Language:PythonApache-2.0776 8 84

humanoid-gym

Humanoid-Gym: Reinforcement Learning for Humanoid Robot with Zero-Shot Sim2Real Transfer https://arxiv.org/abs/2404.05695

Language:Python713 13 28

S3Gaussian

Official Implementation of Self-Supervised Street Gaussians for Autonomous Driving

Language:PythonNOASSERTION404 12 24

dreamerv3-torch

Implementation of Dreamer v3 in pytorch.

Language:PythonMIT394 11 58

madrona

Language:C++MIT309 9 11

nocturne

A data-driven, fast driving simulator for multi-agent coordination under partial observability.

Language:PythonMIT261 13 45

Forge_VFM4AD

A comprehensive survey of forging vision foundation models for autonomous driving, including challenges, methodologies, and opportunities.

227 7 1

LLMTSCS

Official code for article "LLMLight: Large Language Models as Traffic Signal Control Agents".

Language:Python158 5 20

World-Models-Autonomous-Driving-Latest-Survey

A curated list of world models for autonomous driving. Keep updated.

154 90

DriveMLM

144 18 5

twm

Transformer-based World Models

Language:PythonMIT68 5 4

Awesome-Papers-World-Models-Autonomous-Driving

Awesome Papers about World Models in Autonomous Driving

64 5 1

lavad

Official implementation of "Harnessing Large Language Models for Training-free Video Anomaly Detection", CVPR 2024

Language:Python49 5 10

world-models-ppo

PyTorch World Model implementation with PPO.

Language:Python13 30

CityFlowER

An Efficient and Realistic Traffic Simulator with Embedded Machine Learning Models

Language:MakefileApache-2.013 2 1

Dragtraffic

Repo for DragTraffic: Interactive and Controllable Traffic Scene Generation for Autonomous Driving.

Language:Python11 10

SEVD

Synthetic Event-based Vision Dataset for Ego and Fixed Traffic Perception

Language:PythonApache-2.011 4 2

DuaLight

Language:Python10 1 3

raceMOP

Official implementation of the paper "RaceMOP: Mapless Online Path Planning for Multi-Agent Autonomous Racing using Residual Policy Learning"

GPL-3.03 10

nixtla

Python SDK for TimeGPT, a foundational time series model

Language:Jupyter NotebookNOASSERTION200