Siyan Dong (siyandong)

siyandong

Geek Repo

Company:Shandong University

Home Page:https://siyandong.github.io/

Github PK Tool:Github PK Tool

Siyan Dong's starred repositories

Language:MATLABLicense:GPL-3.0Stargazers:10233Issues:0Issues:0

PyTorch-VAE

A Collection of Variational Autoencoders (VAE) in PyTorch.

Language:PythonLicense:Apache-2.0Stargazers:6312Issues:0Issues:0

Mono3D

Source code for "Mononizing Binocular Videos" (SIGGRAPH Asia 2020)

Language:PythonLicense:NOASSERTIONStargazers:39Issues:0Issues:0

multi-scene-pose-transformer

Multi-Scene Camera Pose Regression with Transformers

Language:PythonStargazers:50Issues:0Issues:0

Wonder3D

Single Image to 3D using Cross-Domain Diffusion for 3D Generation

Language:PythonLicense:AGPL-3.0Stargazers:4568Issues:0Issues:0

CLIP

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Language:Jupyter NotebookLicense:MITStargazers:23869Issues:0Issues:0

llama

Inference code for Llama models

Language:PythonLicense:NOASSERTIONStargazers:54678Issues:0Issues:0

LightGlue

LightGlue: Local Feature Matching at Light Speed (ICCV 2023)

Language:PythonLicense:Apache-2.0Stargazers:3198Issues:0Issues:0

FoundationPose

[CVPR 2024 Highlight] FoundationPose: Unified 6D Pose Estimation and Tracking of Novel Objects

Language:PythonLicense:NOASSERTIONStargazers:1213Issues:0Issues:0

MonoGS

[CVPR'24 Highlight & Best Demo Award] Gaussian Splatting SLAM

Language:PythonLicense:NOASSERTIONStargazers:1145Issues:0Issues:0

dust3r

DUSt3R: Geometric 3D Vision Made Easy

Language:PythonLicense:NOASSERTIONStargazers:4770Issues:0Issues:0

3D-LLM

Code for 3D-LLM: Injecting the 3D World into Large Language Models

Language:PythonLicense:MITStargazers:864Issues:0Issues:0

NeuS

Code release for NeuS

Language:PythonLicense:MITStargazers:1530Issues:0Issues:0

nerf-pytorch

A PyTorch implementation of NeRF (Neural Radiance Fields) that reproduces the results.

Language:PythonLicense:MITStargazers:5273Issues:0Issues:0
Language:PythonLicense:MITStargazers:4064Issues:0Issues:0

objaverse-xl

🪐 Objaverse-XL is a Universe of 10M+ 3D Objects. Contains API Scripts for Downloading and Processing!

Language:PythonLicense:Apache-2.0Stargazers:660Issues:0Issues:0

Depth-Anything

[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation

Language:PythonLicense:Apache-2.0Stargazers:6534Issues:0Issues:0

generative-models

Generative Models by Stability AI

Language:PythonLicense:MITStargazers:23536Issues:0Issues:0

gim

GIM: Learning Generalizable Image Matcher From Internet Videos (ICLR 2024 Spotlight)

Language:PythonLicense:MITStargazers:326Issues:0Issues:0

DiT

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Language:PythonLicense:NOASSERTIONStargazers:5773Issues:0Issues:0

SyncDreamer

[ICLR 2024 Spotlight] SyncDreamer: Generating Multiview-consistent Images from a Single-view Image

Language:PythonLicense:MITStargazers:847Issues:0Issues:0
Language:PythonStargazers:48Issues:0Issues:0

BlendedMVS

BlendedMVS: A Large-scale Dataset for Generalized Multi-view Stereo Networks

Stargazers:534Issues:0Issues:0

PoseLib

Minimal solvers for calibrated camera pose estimation

Language:C++License:BSD-3-ClauseStargazers:812Issues:0Issues:0

Uni3D

[ICLR'24 Spotlight] Uni3D: 3D Visual Representation from BAAI

Language:PythonLicense:MITStargazers:437Issues:0Issues:0

dot

Dense Optical Tracking: Connecting the Dots

Language:PythonLicense:MITStargazers:219Issues:0Issues:0

ace

[CVPR 2023 - Highlight] Accelerated Coordinate Encoding (ACE): Learning to Relocalize in Minutes using RGB and Poses

Language:PythonLicense:NOASSERTIONStargazers:339Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:1713Issues:0Issues:0

instant-nsr-pl

Neural Surface reconstruction based on Instant-NGP. Efficient and customizable boilerplate for your research projects. Train NeuS in 10min!

Language:PythonLicense:MITStargazers:822Issues:0Issues:0

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonLicense:Apache-2.0Stargazers:18383Issues:0Issues:0