yoershine's starred repositories

MiniGPT-4

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Language:PythonLicense:BSD-3-ClauseStargazers:25006Issues:219Issues:447

Grounded-Segment-Anything

Grounded-SAM: Marrying Grounding-DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:13748Issues:114Issues:365

dinov2

PyTorch code and models for the DINOv2 self-supervised learning method.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:8044Issues:91Issues:353

Segment-and-Track-Anything

An open-source project dedicated to tracking and segmenting any objects in videos, either automatically or interactively. The primary algorithms utilized include the Segment Anything Model (SAM) for key-frame segmentation and Associating Objects with Transformers (AOT) for efficient tracking and propagation purposes.

Language:Jupyter NotebookLicense:AGPL-3.0Stargazers:2507Issues:50Issues:138

co-tracker

CoTracker is a model for tracking any point (pixel) on a video.

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:2455Issues:26Issues:66
Language:PythonLicense:Apache-2.0Stargazers:2051Issues:128Issues:54

Semantic-SAM

Official implementation of the paper "Semantic-SAM: Segment and Recognize Anything at Any Granularity"

dm-vio

Source code for the paper DM-VIO: Delayed Marginalization Visual-Inertial Odometry

Language:C++License:GPL-3.0Stargazers:931Issues:51Issues:48

mars

MARS: An Instance-aware, Modular and Realistic Simulator for Autonomous Driving

Language:PythonLicense:Apache-2.0Stargazers:639Issues:11Issues:142

street_gaussians

Code for "Street Gaussians for Modeling Dynamic Urban Scenes"

RepViT

RepViT: Revisiting Mobile CNN From ViT Perspective [CVPR 2024] and RepViT-SAM: Towards Real-Time Segmenting Anything

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:595Issues:9Issues:62

neuralsim

neuralsim: 3D surface reconstruction and simulation based on 3D neural rendering.

Language:PythonLicense:MITStargazers:543Issues:42Issues:55

DrivingDiffusion

Layout-Guided multi-view driving scene video generation with latent diffusion model

Language:PythonLicense:MITStargazers:500Issues:18Issues:10

EmerNeRF

PyTorch Implementation of EmerNeRF: Emergent Spatial-Temporal Scene Decomposition via Self-Supervision

Language:PythonLicense:NOASSERTIONStargazers:495Issues:26Issues:24

READ

AAAI2023,implementation of "READ: Large-Scale Neural Scene Rendering for Autonomous Driving", the experimental results are significantly better than Nerf-based methods

Language:PythonLicense:GPL-2.0Stargazers:439Issues:21Issues:65

sphinx-autoapi

A new approach to API documentation in Sphinx.

Language:PythonLicense:MITStargazers:409Issues:14Issues:317

cryptoauthlib

Library for interacting with the Crypto Authentication secure elements

Language:CLicense:NOASSERTIONStargazers:360Issues:41Issues:299

SelfOcc

[CVPR 2024] SelfOcc: Self-Supervised Vision-Based 3D Occupancy Prediction

Language:PythonLicense:Apache-2.0Stargazers:234Issues:15Issues:16

Drive-WM

[CVPR 2024] A world model for autonomous driving.

Language:PythonLicense:Apache-2.0Stargazers:214Issues:20Issues:3

Forge_VFM4AD

A comprehensive survey of forging vision foundation models for autonomous driving, including challenges, methodologies, and opportunities.

NeuRAD

NeuRAD: Neural Rendering for Autonomous Driving

OASim

OASim: an Open and Adaptive Simulator based on Neural Rendering for Autonomous Driving

Language:PythonLicense:Apache-2.0Stargazers:166Issues:12Issues:9

DrivingGaussian

[CVPR 2024] DrivingGaussian: Composite Gaussian Splatting for Surrounding Dynamic Autonomous Driving Scenes

WidthFormer

WidthFormer: Toward Efficient Transformer-based BEV View Transformation

Language:PythonLicense:Apache-2.0Stargazers:109Issues:14Issues:8

panacea

[CVPR2024] Official Repository of Paper "Panacea: Panoramic and Controllable Video Generation for Autonomous Driving"

Language:PythonLicense:Apache-2.0Stargazers:109Issues:13Issues:12

UC-NeRF

[ICLR2024] the official pytorch implementation of UC-NeRF

zod

Software Development Kit for the Zenseact Open Dataset (ZOD)

Language:PythonLicense:MITStargazers:87Issues:6Issues:16

PowerBEV

POWERBEV, a novel and elegant vision-based end-to-end framework that only consists of 2D convolutional layers to perform perception and forecasting of multiple objects in BEVs.

Language:PythonLicense:NOASSERTIONStargazers:79Issues:3Issues:11

MIM4D

MIM4D: Masked Modeling with Multi-View Video for Autonomous Driving Representation Learning