Abhinav Kumar | अभिनव कुमार (abhi1kumar)

abhi1kumar

Geek Repo

Company:Michigan State University (MSU)

Location:East Lansing, MI, USA

Home Page:https://sites.google.com/view/abhinavkumar/

Twitter:@abhinav1kumar

Github PK Tool:Github PK Tool


Organizations
facebookresearch
fairinternal

Abhinav Kumar | अभिनव कुमार's starred repositories

carla

Open-source simulator for autonomous driving research.

Depth-Anything

[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation

Language:PythonLicense:Apache-2.0Stargazers:6042Issues:46Issues:169

awesome-3D-gaussian-splatting

Curated list of papers and resources focused on 3D Gaussian Splatting, intended to keep pace with the anticipated surge of research in the coming months.

dust3r

DUSt3R: Geometric 3D Vision Made Easy

Language:PythonLicense:NOASSERTIONStargazers:4459Issues:52Issues:95

OLMo

Modeling, training, eval, and inference code for OLMo

Language:PythonLicense:Apache-2.0Stargazers:4115Issues:41Issues:157
Language:PythonLicense:Apache-2.0Stargazers:3766Issues:50Issues:104

advice

A repository of links with advice related to grad school applications, research, phd etc

Category_Theory_Machine_Learning

List of papers studying machine learning through the lens of category theory

Language:PythonStargazers:1159Issues:65Issues:0

Birds-eye-view-Perception

[IEEE T-PAMI] Awesome BEV perception research and cookbook for all level audience in autonomous diriving

Language:PythonLicense:Apache-2.0Stargazers:1106Issues:34Issues:18

transfuser

[PAMI'23] TransFuser: Imitation with Transformer-Based Sensor Fusion for Autonomous Driving; [CVPR'21] Multi-Modal Fusion Transformer for End-to-End Autonomous Driving

Language:PythonLicense:MITStargazers:1019Issues:24Issues:219

Awesome-LLM-3D

Awesome-LLM-3D: a curated list of Multi-modal Large Language Model in 3D world Resources

StreamPETR

[ICCV 2023] StreamPETR: Exploring Object-Centric Temporal Modeling for Efficient Multi-View 3D Object Detection

Language:PythonLicense:NOASSERTIONStargazers:497Issues:14Issues:204

BEVerse

The official repository for BEVerse

MonoDETR

[ICCV 2023] The first DETR model for monocular 3D object detection with depth-guided transformer

RoboBEV

RoboBEV: Towards Robust Bird's Eye View Perception under Common Corruption and Domain Shift

SparseBEV

[ICCV 2023] SparseBEV: High-Performance Sparse 3D Object Detection from Multi-Camera Videos

Language:PythonLicense:MITStargazers:292Issues:9Issues:72

mseg-api

An Official Repo of CVPR '20 "MSeg: A Composite Dataset for Multi-Domain Segmentation"

Language:PythonLicense:NOASSERTIONStargazers:245Issues:16Issues:19

DEVIANT

[ECCV 2022] Official PyTorch Code of DEVIANT: Depth Equivariant Network for Monocular 3D Object Detection

Language:C++License:MITStargazers:197Issues:6Issues:29

HoP

[ICCV 2023] Temporal Enhanced Training of Multi-view 3D Object Detector via Historical Object Prediction

Language:PythonLicense:Apache-2.0Stargazers:174Issues:8Issues:17

mvdfusion

[CVPR 2024] MVD-Fusion: Single-view 3D via Depth-consistent Multi-view Generation

Language:PythonLicense:MITStargazers:89Issues:4Issues:8

groomed_nms

[CVPR 2021] Official PyTorch Code of GrooMeD-NMS: Grouped Mathematically Differentiable NMS for Monocular 3D Object Detection

Language:C++License:MITStargazers:86Issues:9Issues:4

MonoNeRD

(ICCV2023) MonoNeRD: NeRF-like Representations for Monocular 3D Object Detection

Language:PythonLicense:MITStargazers:73Issues:7Issues:12

MV2D

Code for "Object as Query: Lifting any 2D Object Detector to 3D Detection"

SeaBird

[CVPR 2024] Official PyTorch Code of SeaBird: Segmentation in Bird's View with Dice Loss Improves Monocular 3D Detection of Large Objects

Language:PythonLicense:MITStargazers:51Issues:2Issues:3

MERL-RAV_dataset

[CVPR 2020] MERL-RAV Dataset contains over 19k faces annotated with 68 landmarks, with the additional information of whether each landmark is unoccluded, self-occluded or externally occluded.

Language:PythonLicense:NOASSERTIONStargazers:36Issues:9Issues:4

LUVLi

[CVPR 2020] Re-hosting of the LUVLi Face Alignment codebase. Please download the codebase from the original MERL website by agreeing to all terms and conditions. By using this code, you agree to MERL's research-only licensing terms.

WildCamera

Tame a Wild Camera: In-the-Wild Monocular Camera Calibration

Language:ShellLicense:Apache-2.0Stargazers:32Issues:6Issues:1

ViewFool_

This repository contains the ViewFool and ImageNet-V proposed by the paper “ViewFool: Evaluating the Robustness of Visual Recognition to Adversarial Viewpoints” (NeurIPS2022).

PolarFormer

[AAAI 2023] PolarFormer: Multi-camera 3D Object Detection with Polar Transformers

Language:PythonLicense:MITStargazers:2Issues:0Issues:0