whuhangzhang's starred repositories

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Language:PythonLicense:MITStargazers:28293Issues:187Issues:916

diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.

Language:PythonLicense:Apache-2.0Stargazers:23828Issues:193Issues:3745

MiniCPM-V

MiniCPM-Llama3-V 2.5: A GPT-4V Level Multimodal LLM on Your Phone

Language:PythonLicense:Apache-2.0Stargazers:7729Issues:74Issues:262

bevfusion

[ICRA'23] BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation

Language:PythonLicense:Apache-2.0Stargazers:2119Issues:42Issues:603

detrex

detrex is a research platform for DETR-based object detection, segmentation, pose estimation and other visual recognition tasks.

Language:PythonLicense:Apache-2.0Stargazers:1892Issues:26Issues:155

4DGaussians

[CVPR 2024] 4D Gaussian Splatting for Real-Time Dynamic Scene Rendering

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:1882Issues:39Issues:126

Gaussian-SLAM

Gaussian-SLAM: Photo-realistic Dense SLAM with Gaussian Splatting

Language:PythonLicense:MITStargazers:815Issues:56Issues:24

flowmap

Code for "FlowMap: High-Quality Camera Poses, Intrinsics, and Depth via Gradient Descent" by Cameron Smith*, David Charatan*, Ayush Tewari, and Vincent Sitzmann

Language:PythonLicense:MITStargazers:812Issues:19Issues:39

DSINE

[CVPR 2024 Oral] Rethinking Inductive Biases for Surface Normal Estimation

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:620Issues:9Issues:7

Grounding-DINO-1.5-API

API for Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series

Language:PythonLicense:Apache-2.0Stargazers:581Issues:11Issues:26

DetectorFreeSfM

Code for "Detector-Free Structure from Motion", CVPR 2024

Language:PythonLicense:Apache-2.0Stargazers:522Issues:75Issues:47

vggsfm

[CVPR 2024 Highlight] VGGSfM Visual Geometry Grounded Deep Structure From Motion

Language:PythonLicense:NOASSERTIONStargazers:445Issues:27Issues:15

gaussian_surfels

Implementation of the SIGGRAPH 2024 conference paper "High-quality Surface Reconstruction using Gaussian Surfels".

VQGAN-pytorch

Pytorch implementation of VQGAN (Taming Transformers for High-Resolution Image Synthesis) (https://arxiv.org/pdf/2012.09841.pdf)

Language:PythonLicense:MITStargazers:389Issues:3Issues:16

dn-splatter

DN-Splatter: Depth and Normal Priors for Gaussian Splatting and Meshing

Language:PythonLicense:Apache-2.0Stargazers:304Issues:15Issues:31

GaussianCube

GaussianCube: A Structured and Explicit Radiance Representation for 3D Generative Modeling

Language:C++License:Apache-2.0Stargazers:242Issues:11Issues:14

street-gaussians-ns

Unofficial implementation of "Street Gaussians for Modeling Dynamic Urban Scenes"

Language:PythonLicense:Apache-2.0Stargazers:200Issues:10Issues:22

DrivingGaussian

[CVPR 2024] DrivingGaussian: Composite Gaussian Splatting for Surrounding Dynamic Autonomous Driving Scenes

MVSGaussian

MVSGaussian: Fast Generalizable Gaussian Splatting Reconstruction from Multi-View Stereo

StopThePop

Original reference implementation of "StopThePop: Sorted Gaussian Splatting for View-Consistent Real-time Rendering"

Language:PythonLicense:NOASSERTIONStargazers:96Issues:5Issues:5

sam_road

Segment Anything Model for large-scale, vectorized road network extraction from aerial imagery. CVPRW 2024

Language:PythonLicense:MITStargazers:93Issues:2Issues:21
Language:PythonLicense:Apache-2.0Stargazers:82Issues:3Issues:6
Language:PythonLicense:NOASSERTIONStargazers:75Issues:10Issues:11

DoGaussian

DoGaussian: Distributed-Oriented Gaussian Splatting for Large-Scale 3D Reconstruction Via Gaussian Consensus

SatforHDMap

The implementation of our ICRA2024 submission manuscript paper "Complementing Onboard Sensors with Satellite Map: A New Perspective for HD Map Construction"

Language:PythonLicense:GPL-3.0Stargazers:33Issues:0Issues:0

odw-2024

Materials from GW Open Data Workshop, 2024

Language:Jupyter NotebookLicense:GPL-3.0Stargazers:15Issues:0Issues:0

Lightweight-Deformable-GS

[Just 4 Fun] Deformable-GS with less storage, 150+FPS, and sota quality.

Language:PythonLicense:MITStargazers:12Issues:0Issues:0

FastScene

FastScene: Text-Driven Fast 3D Indoor Scene Generation via Panoramic Gaussian Splatting (IJCAI-2024)

Stargazers:4Issues:0Issues:0