whuhxb

Xiaobing Han's repositories

mars

Mars is a cross-platform network component developed by WeChat.

NOASSERTION000

Unique3D

Official implementation of Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image

MIT000

MeshXL

MeshXL: Neural Coordinate Field for Generative 3D Foundation Models; 3D generative fundamental models using NeurCF

000

S3Gaussian

Official Implementation of Self-Supervised Street Gaussians for Autonomous Driving

NOASSERTION000

OccSora

OccSora: 4D Occupancy Generation Models as World Simulators for Autonomous Driving

Apache-2.0000

bioclip

This is the repository for the BioCLIP model and the TreeOfLife-10M dataset [CVPR'24 Oral].

NOASSERTION000

Co-Occ

[IEEE RA-L] Co-Occ: Coupling Explicit Feature Fusion with Volume Rendering Regularization for Multi-Modal 3D Semantic Occupancy Prediction

Apache-2.0000

delta

An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs

Apache-2.0000

CAPEv2

Malware Configuration And Payload Extraction

NOASSERTION000

GaussianFormer

Scene as Gaussians for Vision-Based 3D Semantic Occupancy Prediction

000

PointRWKV

000

MiniCPM-V

MiniCPM-Llama3-V 2.5: A GPT-4V Level Multimodal LLM on Your Phone

Apache-2.0000

multimodal

TorchMultimodal is a PyTorch library for training state-of-the-art multimodal multi-task models at scale.

BSD-3-Clause000

paper-list-added

autoupdate paper list

Apache-2.0000

shine

[CVPR'24 Highlight] SHiNe: Semantic Hierarchy Nexus for Open-vocabulary Object Detection

NOASSERTION000

SHERT

[CVPR'24 Oral] Official PyTorch implementation for Semantic Human Mesh Reconstruction with Textures.

MIT000

OmDet

Fast and accurate open-vocabulary end-to-end object detection

Apache-2.0000

conv-llava

Apache-2.0000

Anchor3DLane

Official PyTorch implementation for paper`Anchor3DLane: Learning to Regress 3D Anchors for Monocular 3D Lane Detection' accepted by CVPR 2023

000

LLaMA2-Accessory

An Open-source Toolkit for LLM Development

NOASSERTION000

Paper-List

A paper list of my history reading. Robotics, Learning, Vision.

000

semantic-gaussians

Official implemetation of the paper "Semantic Gaussians: Open-Vocabulary Scene Understanding with 3D Gaussian Splatting".

MIT000

3DGStream

[CVPR 2024 Highlight] Official repository for the paper "3DGStream: On-the-fly Training of 3D Gaussians for Efficient Streaming of Photo-Realistic Free-Viewpoint Videos".

MIT000

detic-sam

Detic + SAM for open-vocabulary object detection and segmentation.

MIT000

PaSCo

[CVPR 2024 Oral - Best paper award candidate] Official repository of "PaSCo: Urban 3D Panoptic Scene Completion with Uncertainty Awareness"

Apache-2.0000

AnimatableGaussians

Code of [CVPR 2024] "Animatable Gaussians: Learning Pose-dependent Gaussian Maps for High-fidelity Human Avatar Modeling"

NOASSERTION000

PaperReading

Apache-2.0000

ISAT_with_segment_anything

Labeling tool with SAM(segment anything model),supports SAM, sam-hq, MobileSAM EdgeSAM etc.交互式半自动图像标注工具

NOASSERTION000

odin

Code for the paper: "ODIN: A Single Model for 2D and 3D Segmentation" (CVPR 2024)

MIT000

vid2avatar

Vid2Avatar: 3D Avatar Reconstruction from Videos in the Wild via Self-supervised Scene Decomposition (CVPR2023)

NOASSERTION000