abhi1kumar

Abhinav Kumar | अभिनव कुमार's starred repositories

carla

Open-source simulator for autonomous driving research.

Depth-Anything

[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation

Language:PythonApache-2.06042 46 169

awesome-3D-gaussian-splatting

Curated list of papers and resources focused on 3D Gaussian Splatting, intended to keep pace with the anticipated surge of research in the coming months.

MIT4791 246 50

dust3r

DUSt3R: Geometric 3D Vision Made Easy

Language:PythonNOASSERTION4459 52 95

OLMo

Modeling, training, eval, and inference code for OLMo

Language:PythonApache-2.04115 41 157

advice

A repository of links with advice related to grad school applications, research, phd etc

MIT1608 25 1

Category_Theory_Machine_Learning

List of papers studying machine learning through the lens of category theory

Language:Python1159 650

Birds-eye-view-Perception

[IEEE T-PAMI] Awesome BEV perception research and cookbook for all level audience in autonomous diriving

Language:PythonApache-2.01106 34 18

transfuser

[PAMI'23] TransFuser: Imitation with Transformer-Based Sensor Fusion for Autonomous Driving; [CVPR'21] Multi-Modal Fusion Transformer for End-to-End Autonomous Driving

Language:PythonMIT1019 24 219

Awesome-LLM-3D

Awesome-LLM-3D: a curated list of Multi-modal Large Language Model in 3D world Resources

MIT757 34 3

StreamPETR

[ICCV 2023] StreamPETR: Exploring Object-Centric Temporal Modeling for Efficient Multi-View 3D Object Detection

Language:PythonNOASSERTION497 14 204

BEVerse

The official repository for BEVerse

Language:Python371 32 29

MonoDETR

[ICCV 2023] The first DETR model for monocular 3D object detection with depth-guided transformer

Language:Python322 10 62

RoboBEV

RoboBEV: Towards Robust Bird's Eye View Perception under Common Corruption and Domain Shift

Language:Python301 9 9

SparseBEV

[ICCV 2023] SparseBEV: High-Performance Sparse 3D Object Detection from Multi-Camera Videos

Language:PythonMIT292 9 72

mseg-api

An Official Repo of CVPR '20 "MSeg: A Composite Dataset for Multi-Domain Segmentation"

Language:PythonNOASSERTION245 16 19

DEVIANT

[ECCV 2022] Official PyTorch Code of DEVIANT: Depth Equivariant Network for Monocular 3D Object Detection

Language:C++MIT197 6 29

HoP

[ICCV 2023] Temporal Enhanced Training of Multi-view 3D Object Detector via Historical Object Prediction

Language:PythonApache-2.0174 8 17

mvdfusion

[CVPR 2024] MVD-Fusion: Single-view 3D via Depth-consistent Multi-view Generation

Language:PythonMIT89 4 8

groomed_nms

[CVPR 2021] Official PyTorch Code of GrooMeD-NMS: Grouped Mathematically Differentiable NMS for Monocular 3D Object Detection

Language:C++MIT86 9 4

MonoNeRD

(ICCV2023) MonoNeRD: NeRF-like Representations for Monocular 3D Object Detection

Language:PythonMIT73 7 12

MV2D

Code for "Object as Query: Lifting any 2D Object Detector to 3D Detection"

Language:Python71 2 14

SeaBird

[CVPR 2024] Official PyTorch Code of SeaBird: Segmentation in Bird's View with Dice Loss Improves Monocular 3D Detection of Large Objects

Language:PythonMIT51 2 3

MERL-RAV_dataset

[CVPR 2020] MERL-RAV Dataset contains over 19k faces annotated with 68 landmarks, with the additional information of whether each landmark is unoccluded, self-occluded or externally occluded.

Language:Python39 3 3

[CVPR 2020] Re-hosting of the LUVLi Face Alignment codebase. Please download the codebase from the original MERL website by agreeing to all terms and conditions. By using this code, you agree to MERL's research-only licensing terms.

34 3 3

WildCamera

Tame a Wild Camera: In-the-Wild Monocular Camera Calibration

Language:ShellApache-2.032 6 1

ViewFool_

This repository contains the ViewFool and ImageNet-V proposed by the paper “ViewFool: Evaluating the Robustness of Visual Recognition to Adversarial Viewpoints” (NeurIPS2022).

Language:Python26 2 3

PolarFormer

[AAAI 2023] PolarFormer: Multi-camera 3D Object Detection with Polar Transformers

Language:PythonMIT200