ZhongdaoWang (Zhongdao)

Zhongdao

Geek Repo

Company:Tsinghua University

Location:Beijing, China

Home Page:https://zhongdao.github.io

Github PK Tool:Github PK Tool

ZhongdaoWang's starred repositories

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonLicense:Apache-2.0Stargazers:15974Issues:152Issues:1241

CogVLM

a state-of-the-art-level open visual language model | 多模态预训练模型

Language:PythonLicense:Apache-2.0Stargazers:4945Issues:63Issues:360

AnyDoor

Official implementations for paper: Anydoor: zero-shot object-level image customization

Language:PythonLicense:MITStargazers:3671Issues:83Issues:77

mobile-aloha

Mobile ALOHA: Learning Bimanual Mobile Manipulation with Low-Cost Whole-Body Teleoperation

Language:Jupyter NotebookLicense:MITStargazers:3339Issues:70Issues:8

vector-quantize-pytorch

Vector (and Scalar) Quantization, in Pytorch

Language:PythonLicense:MITStargazers:1860Issues:26Issues:89

4DGaussians

[CVPR 2024] 4D Gaussian Splatting for Real-Time Dynamic Scene Rendering

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:1646Issues:34Issues:96

LISA

Project Page for "LISA: Reasoning Segmentation via Large Language Model"

Language:PythonLicense:Apache-2.0Stargazers:1441Issues:8Issues:113

DeepSeek-LLM

DeepSeek LLM: Let there be answers

Language:MakefileLicense:MITStargazers:1115Issues:19Issues:31

magvit

Official JAX implementation of MAGVIT: Masked Generative Video Transformer

Language:PythonLicense:Apache-2.0Stargazers:843Issues:75Issues:19

zipnerf-pytorch

Unofficial implementation of ZipNeRF

Language:PythonLicense:Apache-2.0Stargazers:747Issues:16Issues:96

3D-LLM

Code for 3D-LLM: Injecting the 3D World into Large Language Models

Language:PythonLicense:MITStargazers:733Issues:16Issues:43

OMG-Seg

[CVPR-2024] One Model For Image/Video/Instractive/Open-Vocabulary Segmentation

Language:PythonLicense:NOASSERTIONStargazers:681Issues:7Issues:2

EdgeSAM

Official PyTorch implementation of "EdgeSAM: Prompt-In-the-Loop Distillation for On-Device Deployment of SAM"

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:675Issues:14Issues:20

DriveLM

DriveLM: Driving with Graph Visual Question Answering

Language:HTMLLicense:Apache-2.0Stargazers:619Issues:25Issues:29

Awesome-LLM4AD

A curated list of awesome LLM for Autonomous Driving resources (continually updated)

LLaMA-VID

Official Implementation for LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models

Language:PythonLicense:Apache-2.0Stargazers:567Issues:9Issues:79

Vary-toy

Official code implementation of Vary-toy (Small Language Model Meets with Reinforced Vision Vocabulary)

neuralsim

neuralsim: 3D surface reconstruction and simulation based on 3D neural rendering.

Language:PythonLicense:MITStargazers:522Issues:41Issues:51

PointTransformerV3

[CVPR'24 Oral] Official repository of Point Transformer V3 (PTv3)

Language:PythonLicense:MITStargazers:427Issues:12Issues:18

magvit2-pytorch

Implementation of MagViT2 Tokenizer in Pytorch

Language:PythonLicense:MITStargazers:405Issues:28Issues:29

LangSplat

Official implementation of the paper "LangSplat: 3D Language Gaussian Splatting" [CVPR2024 Highlight]

Language:PythonLicense:NOASSERTIONStargazers:388Issues:18Issues:17

MaskDiT

Code for Fast Training of Diffusion Models with Masked Transformers

Language:PythonLicense:MITStargazers:277Issues:14Issues:9

OccWorld

3D World Model for Autonomous Driving

Language:PythonLicense:Apache-2.0Stargazers:257Issues:9Issues:20

SelfOcc

[CVPR 2024] SelfOcc: Self-Supervised Vision-Based 3D Occupancy Prediction

Language:PythonLicense:Apache-2.0Stargazers:217Issues:14Issues:14

Denoising-ViT

This is the official code release for our work, Denoising Vision Transformers.

Language:PythonLicense:MITStargazers:191Issues:14Issues:6

UC-NeRF

the official pytorch implementation of UC-NeRF

Uni3DETR

Code release for our NeurIPS 2023 paper "Uni3DETR: Unified 3D Detection Transformer".

Language:PythonLicense:Apache-2.0Stargazers:58Issues:4Issues:6