whuhangzhang's starred repositories

MobileAgent

Mobile-Agent: The Powerful Mobile Device Operation Assistant Family

Language:PythonLicense:MITStargazers:2573Issues:0Issues:0

LISA

Project Page for "LISA: Reasoning Segmentation via Large Language Model"

Language:PythonLicense:Apache-2.0Stargazers:1713Issues:0Issues:0

GaussianCity

The official implementation of "GaussianCity: Generative Gaussian Splatting for Unbounded 3D City Generation". (Xie et al., arXiv 2406.06526)

Stargazers:70Issues:0Issues:0

diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.

Language:PythonLicense:Apache-2.0Stargazers:24693Issues:0Issues:0

VQGAN-pytorch

Pytorch implementation of VQGAN (Taming Transformers for High-Resolution Image Synthesis) (https://arxiv.org/pdf/2012.09841.pdf)

Language:PythonLicense:MITStargazers:419Issues:0Issues:0

MiniCPM-V

MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone

Language:PythonLicense:Apache-2.0Stargazers:10784Issues:0Issues:0

detrex

detrex is a research platform for DETR-based object detection, segmentation, pose estimation and other visual recognition tasks.

Language:PythonLicense:Apache-2.0Stargazers:1952Issues:0Issues:0

GaussianCube

GaussianCube: A Structured and Explicit Radiance Representation for 3D Generative Modeling

Language:PythonStargazers:280Issues:0Issues:0

Gaussian-SLAM

Gaussian-SLAM: Photo-realistic Dense SLAM with Gaussian Splatting

Language:PythonLicense:MITStargazers:843Issues:0Issues:0

DoGaussian

DoGaussian: Distributed-Oriented Gaussian Splatting for Large-Scale 3D Reconstruction Via Gaussian Consensus

Stargazers:90Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:91Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:90Issues:0Issues:0

MVSGaussian

[ECCV 2024] MVSGaussian: Fast Generalizable Gaussian Splatting Reconstruction from Multi-View Stereo

Language:PythonLicense:MITStargazers:337Issues:0Issues:0

FastScene

FastScene: Text-Driven Fast 3D Indoor Scene Generation via Panoramic Gaussian Splatting (IJCAI-2024)

Language:PythonStargazers:24Issues:0Issues:0

DSINE

[CVPR 2024 Oral] Rethinking Inductive Biases for Surface Normal Estimation

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:658Issues:0Issues:0

dn-splatter

DN-Splatter: Depth and Normal Priors for Gaussian Splatting and Meshing

Language:PythonLicense:Apache-2.0Stargazers:363Issues:0Issues:0

StopThePop

Original reference implementation of "StopThePop: Sorted Gaussian Splatting for View-Consistent Real-time Rendering"

Language:PythonLicense:NOASSERTIONStargazers:133Issues:0Issues:0

street-gaussians-ns

Unofficial implementation of "Street Gaussians: Modeling Dynamic Urban Scenes with Gaussian Splatting", ECCV2024.

Language:PythonLicense:Apache-2.0Stargazers:280Issues:0Issues:0

Lightweight-Deformable-GS

[Just 4 Fun] Deformable-GS with less storage, 150+FPS, and sota quality.

Language:PythonLicense:MITStargazers:21Issues:0Issues:0

DetectorFreeSfM

Code for "Detector-Free Structure from Motion", CVPR 2024

Language:PythonLicense:Apache-2.0Stargazers:563Issues:0Issues:0

vggsfm

VGGSfM: Visual Geometry Grounded Deep Structure From Motion

Language:PythonLicense:NOASSERTIONStargazers:782Issues:0Issues:0

flowmap

Code for "FlowMap: High-Quality Camera Poses, Intrinsics, and Depth via Gradient Descent" by Cameron Smith*, David Charatan*, Ayush Tewari, and Vincent Sitzmann

Language:PythonLicense:MITStargazers:848Issues:0Issues:0

4DGaussians

[CVPR 2024] 4D Gaussian Splatting for Real-Time Dynamic Scene Rendering

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:2014Issues:0Issues:0

VastGaussian

This is an unofficial Implementation

Language:C++License:Apache-2.0Stargazers:297Issues:0Issues:0

Grounding-DINO-1.5-API

API for Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series

Language:PythonLicense:Apache-2.0Stargazers:680Issues:0Issues:0

SatforHDMap

The implementation of our ICRA2024 submission manuscript paper "Complementing Onboard Sensors with Satellite Map: A New Perspective for HD Map Construction"

Language:PythonLicense:GPL-3.0Stargazers:36Issues:0Issues:0

bevfusion

[ICRA'23] BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation

Language:PythonLicense:Apache-2.0Stargazers:2193Issues:0Issues:0

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Language:PythonLicense:MITStargazers:31235Issues:0Issues:0

odw-2024

Materials from GW Open Data Workshop, 2024

Language:Jupyter NotebookLicense:GPL-3.0Stargazers:17Issues:0Issues:0