Jiajun Deng (djiajunustc)

djiajunustc

Geek Repo

Company:University of Adelaide

Location:Adelaide

Github PK Tool:Github PK Tool

Jiajun Deng's starred repositories

dust3r

DUSt3R: Geometric 3D Vision Made Easy

Language:PythonLicense:NOASSERTIONStargazers:4756Issues:54Issues:124

lerobot

šŸ¤— LeRobot: End-to-end Learning for Real-World Robotics in Pytorch

Language:PythonLicense:Apache-2.0Stargazers:4538Issues:51Issues:68

Segment-Everything-Everywhere-All-At-Once

[NeurIPS 2023] Official implementation of the paper "Segment Everything Everywhere All at Once"

Language:PythonLicense:Apache-2.0Stargazers:4226Issues:58Issues:143

YOLO-World

[CVPR 2024] Real-Time Open-Vocabulary Object Detection

Language:PythonLicense:GPL-3.0Stargazers:4043Issues:37Issues:389
Language:PythonLicense:NOASSERTIONStargazers:1846Issues:93Issues:37

SplaTAM

SplaTAM: Splat, Track & Map 3D Gaussians for Dense RGB-D SLAM (CVPR 2024)

Language:PythonLicense:BSD-3-ClauseStargazers:1382Issues:36Issues:117

FeatUp

Official code for "FeatUp: A Model-Agnostic Frameworkfor Features at Any Resolution" ICLR 2024

Language:Jupyter NotebookLicense:MITStargazers:1303Issues:19Issues:57

MonoGS

[CVPR'24 Highlight & Best Demo Award] Gaussian Splatting SLAM

Language:PythonLicense:NOASSERTIONStargazers:1139Issues:16Issues:113

Video-ChatGPT

[ACL 2024 šŸ”„] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted for spatiotemporal video representation. We also introduce a rigorous 'Quantitative Evaluation Benchmarking' for video-based conversational models.

Language:PythonLicense:CC-BY-4.0Stargazers:1082Issues:13Issues:112

home-robot

Mobile manipulation research tools for roboticists

Language:PythonLicense:MITStargazers:827Issues:31Issues:155

Deformable-3D-Gaussians

[CVPR 2024] Official implementation of "Deformable 3D Gaussians for High-Fidelity Monocular Dynamic Scene Reconstruction"

Language:PythonLicense:MITStargazers:800Issues:14Issues:68

Neural-Network-Parameter-Diffusion

We introduce a novel approach for parameter generation, named neural network parameter diffusion (p-diff), which employs a standard latent diffusion model to synthesize a new set of parameters

openvla

OpenVLA: An open-source vision-language-action model for robotic manipulation.

Language:PythonLicense:MITStargazers:717Issues:12Issues:23

Grounding-DINO-1.5-API

API for Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series

Language:PythonLicense:Apache-2.0Stargazers:640Issues:11Issues:29

ManiSkill

SAPIEN Manipulation Skill Framework, a GPU parallelized robotics simulator and benchmark

Language:PythonLicense:Apache-2.0Stargazers:599Issues:17Issues:191

EmerNeRF

PyTorch Implementation of EmerNeRF: Emergent Spatial-Temporal Scene Decomposition via Self-Supervision

Language:PythonLicense:NOASSERTIONStargazers:527Issues:27Issues:26

PonderV2

PonderV2: Pave the Way for 3D Foundation Model with A Universal Pre-training Paradigm

Language:PythonLicense:MITStargazers:309Issues:20Issues:20

video2game

Code release of Video2Game

Language:JavaScriptLicense:MITStargazers:290Issues:11Issues:3

GiT

šŸ”„ [ECCV2024] Official Implementation of "GiT: Towards Generalist Vision Transformer through Universal Language Interface"

Language:PythonLicense:Apache-2.0Stargazers:256Issues:6Issues:8

PLA

(CVPR 2023) PLA: Language-Driven Open-Vocabulary 3D Scene Understanding & (CVPR2024) RegionPLC: Regional Point-Language Contrastive Learning for Open-World 3D Scene Understanding

Language:PythonLicense:Apache-2.0Stargazers:241Issues:13Issues:49

Agent-Driver

A Language Agent for Autonomous Driving

Language:PythonLicense:MITStargazers:194Issues:13Issues:9

open-eqa

OpenEQA Embodied Question Answering in the Era of Foundation Models

Language:Jupyter NotebookLicense:MITStargazers:191Issues:13Issues:8

GeMap

[ECCV'24] Online Vectorized HD Map Construction using Geometry

Language:PythonLicense:Apache-2.0Stargazers:171Issues:8Issues:12

BunnyVisionPro

Bimanual Dexterous Teleoperation with Real-Time Retargeting using VisionPro

Language:PythonLicense:MITStargazers:153Issues:5Issues:2

L3MVN

Leveraging Large Language Models for Visual Target Navigation

spoc-robot-training

SPOC: Imitating Shortest Paths in Simulation Enables Effective Navigation and Manipulation in the Real World

Language:PythonLicense:NOASSERTIONStargazers:61Issues:6Issues:7
Language:PythonStargazers:26Issues:0Issues:0

DeepEraser

The official code for ā€œDeepEraser: Deep Iterative Context Mining for Generic Text Eraserā€.