Mingkang Xiong (MingkangXiong)

MingkangXiong

Geek Repo

Location:Shanghai, China

Home Page:https://mingkangxiong.github.io

Github PK Tool:Github PK Tool

Mingkang Xiong's starred repositories

ChatTTS

A generative speech model for daily dialogue.

Language:PythonLicense:AGPL-3.0Stargazers:27969Issues:166Issues:396

nerfstudio

A collaboration friendly studio for NeRFs

Language:PythonLicense:Apache-2.0Stargazers:8966Issues:112Issues:1550

latent-consistency-model

Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference

Language:PythonLicense:MITStargazers:4222Issues:61Issues:91

HunyuanDiT

Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

Language:PythonLicense:NOASSERTIONStargazers:2818Issues:33Issues:135

2d-gaussian-splatting

[SIGGRAPH'24] 2D Gaussian Splatting for Geometrically Accurate Radiance Fields

Language:PythonLicense:NOASSERTIONStargazers:1684Issues:40Issues:104

SplaTAM

SplaTAM: Splat, Track & Map 3D Gaussians for Dense RGB-D SLAM (CVPR 2024)

Language:PythonLicense:BSD-3-ClauseStargazers:1378Issues:35Issues:110

MonoGS

[CVPR'24 Highlight & Best Demo Award] Gaussian Splatting SLAM

Language:PythonLicense:NOASSERTIONStargazers:1130Issues:14Issues:106

ArgoX

Argo Xray for VPS one-click script. 一键脚本

Awesome-LLM-3D

Awesome-LLM-3D: a curated list of Multi-modal Large Language Model in 3D world Resources

Real-Time-Latent-Consistency-Model

App showcasing multiple real-time diffusion models pipelines with Diffusers

Language:PythonLicense:Apache-2.0Stargazers:848Issues:18Issues:37

Gaussian-SLAM

Gaussian-SLAM: Photo-realistic Dense SLAM with Gaussian Splatting

Language:PythonLicense:MITStargazers:836Issues:56Issues:24

LLM-in-Vision

Recent LLM-based CV and related works. Welcome to comment/contribute!

X-StereoLab

SOS IROS 2018 GOOGLE; StereoNet ECCV2018 GOOGLE; ActiveStereoNet ECCV2018 Oral GOOGLE; HITNET CVPR2021 GOOGLE;PLUME Uber ATG

Language:PythonLicense:MITStargazers:678Issues:31Issues:51

DetectorFreeSfM

Code for "Detector-Free Structure from Motion", CVPR 2024

Language:PythonLicense:Apache-2.0Stargazers:541Issues:75Issues:56

CREStereo

Official MegEngine implementation of CREStereo(CVPR 2022 Oral).

Language:PythonLicense:Apache-2.0Stargazers:463Issues:13Issues:56

robocasa

RoboCasa: Large-Scale Simulation of Everyday Tasks for Generalist Robots

Language:PythonLicense:NOASSERTIONStargazers:396Issues:8Issues:23

NeRF-Supervised-Deep-Stereo

A novel paradigm for collecting and generating stereo training data using neural rendering

Language:PythonLicense:MITStargazers:345Issues:16Issues:47

S3Gaussian

Official Implementation of Self-Supervised Street Gaussians for Autonomous Driving

Language:PythonLicense:NOASSERTIONStargazers:334Issues:12Issues:17

vlmaps

[ICRA2023] Implementation of Visual Language Maps for Robot Navigation

Language:PythonLicense:MITStargazers:325Issues:11Issues:51

CREStereo-Pytorch

Non-official Pytorch implementation of the CREStereo(CVPR 2022 Oral).

Awesome-Deep-Stereo-Matching

A curated list of awesome Deep Stereo Matching resources

Language:TeXLicense:MITStargazers:146Issues:6Issues:0

TaPA

[arXiv 2023] Embodied Task Planning with Large Language Models

MoCha-Stereo

[CVPR2024] The official implementation of "MoCha-Stereo: Motif Channel Attention Network for Stereo Matching”.

Language:PythonLicense:MITStargazers:85Issues:10Issues:6
Language:PythonStargazers:69Issues:0Issues:4

Demand-driven-navigation

Find What You Want: Learning Demand-conditioned Object Attribute Space for Demand-driven Navigation

Language:PythonLicense:NOASSERTIONStargazers:33Issues:3Issues:0

NaviLLM

[CVPR 2024] The code for paper 'Towards Learning a Generalist Model for Embodied Navigation'

Language:PythonLicense:MITStargazers:9Issues:0Issues:0
Language:PythonStargazers:8Issues:0Issues:0

ActiveZero

[CVPR 22'] ActiveZero: Mixed Domain Learning for Active Stereovision with Zero Annotation