2132660698's repositories

License:Apache-2.0Stargazers:0Issues:0Issues:0
License:Apache-2.0Stargazers:0Issues:0Issues:0

Awesome-Multi-Camera-3D-Occupancy-Prediction

Awesome papers and code about Multi-Camera 3D Occupancy Prediction, such as TPVFormer, SurroundOcc, PanoOcc, OccFormer, FB-OCC, SelfOcc, COTR, SparseOcc. In this repository, you will see the latest 3D occupancy prediction papers and code.

License:MITStargazers:0Issues:0Issues:0

Bench2Drive

Closed-loop multi-ability evaluation of end-to-end autonomous driving algorithms

License:Apache-2.0Stargazers:0Issues:0Issues:0

Bench2DriveZoo

BEVFormer, UniAD, VAD in CARLA under Closed-Loop Evaluation

Stargazers:0Issues:0Issues:0

BEV-Perception

Bird's Eye View Perception

License:MITStargazers:0Issues:0Issues:0

carla_garage

[ICCV'23] Hidden Biases of End-to-End Driving Models

License:MITStargazers:0Issues:0Issues:0

ChatTTS

ChatTTS is a generative speech model for daily dialogue.

License:NOASSERTIONStargazers:0Issues:0Issues:0

CVT-Occ

CVT-Occ: Cost Volume Temporal Fusion for 3D Occupancy Prediction

Stargazers:0Issues:0Issues:0

Dolphins111

[ECCV 2024] The official code for "Dolphins: Multimodal Language Model for Driving“

License:MITStargazers:0Issues:0Issues:0

DriveLM

[ECCV 2024] DriveLM: Driving with Graph Visual Question Answering

License:Apache-2.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

firecrawl

🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.

License:AGPL-3.0Stargazers:0Issues:0Issues:0

GenAD

[ECCV 2024] GenAD: Generative End-to-End Autonomous Driving

License:Apache-2.0Stargazers:0Issues:0Issues:0

GLM-4

GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型

License:Apache-2.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

LMDrive

[CVPR 2024] LMDrive: Closed-Loop End-to-End Driving with Large Language Models

License:Apache-2.0Stargazers:0Issues:0Issues:0

MindSearch

🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)

License:Apache-2.0Stargazers:0Issues:0Issues:0

OccNet-Course

国内首个占据栅格网络全栈课程《从BEV到Occupancy Network,算法原理与工程实践》,包含端侧部署。Surrounding Semantic Occupancy Perception Course for Autonomous Driving (docs, ppt and source code) 在线课程主页:http://111.229.117.200:8100/ (作者独立搭建)

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

OmniCorpus

OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text

Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

openedai-vision

An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.

License:AGPL-3.0Stargazers:0Issues:0Issues:0

OpenGlass

Turn any glasses into AI-powered smart glasses

License:MITStargazers:0Issues:0Issues:0

SparseDrive

SparseDrive: End-to-End Autonomous Driving via Sparse Scene Representation

License:MITStargazers:0Issues:0Issues:0

SparseOcc

Official implementation for 'SparseOcc: Rethinking Sparse Latent Representation for Vision-Based Semantic Occupancy Prediction' (CVPR 2024)

License:Apache-2.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

tiny-gpu

A minimal GPU design in Verilog to learn how GPUs work from the ground up

Stargazers:0Issues:0Issues:0

VAD

[ICCV 2023] VAD: Vectorized Scene Representation for Efficient Autonomous Driving

License:Apache-2.0Stargazers:0Issues:0Issues:0

ViewFormer-Occ

[ECCV 2024] ViewFormer: Exploring Spatiotemporal Modeling for Multi-View 3D Occupancy Perception via View-Guided Transformers

License:Apache-2.0Stargazers:0Issues:0Issues:0

Vista

A Generalizable World Model for Autonomous Driving

License:Apache-2.0Stargazers:0Issues:0Issues:0