Xiaobing Han's repositories
mars
Mars is a cross-platform network component developed by WeChat.
Unique3D
Official implementation of Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image
MeshXL
MeshXL: Neural Coordinate Field for Generative 3D Foundation Models; 3D generative fundamental models using NeurCF
S3Gaussian
Official Implementation of Self-Supervised Street Gaussians for Autonomous Driving
OccSora
OccSora: 4D Occupancy Generation Models as World Simulators for Autonomous Driving
bioclip
This is the repository for the BioCLIP model and the TreeOfLife-10M dataset [CVPR'24 Oral].
Co-Occ
[IEEE RA-L] Co-Occ: Coupling Explicit Feature Fusion with Volume Rendering Regularization for Multi-Modal 3D Semantic Occupancy Prediction
delta
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
CAPEv2
Malware Configuration And Payload Extraction
GaussianFormer
Scene as Gaussians for Vision-Based 3D Semantic Occupancy Prediction
MiniCPM-V
MiniCPM-Llama3-V 2.5: A GPT-4V Level Multimodal LLM on Your Phone
multimodal
TorchMultimodal is a PyTorch library for training state-of-the-art multimodal multi-task models at scale.
paper-list-added
autoupdate paper list
shine
[CVPR'24 Highlight] SHiNe: Semantic Hierarchy Nexus for Open-vocabulary Object Detection
SHERT
[CVPR'24 Oral] Official PyTorch implementation for Semantic Human Mesh Reconstruction with Textures.
OmDet
Fast and accurate open-vocabulary end-to-end object detection
Anchor3DLane
Official PyTorch implementation for paper`Anchor3DLane: Learning to Regress 3D Anchors for Monocular 3D Lane Detection' accepted by CVPR 2023
LLaMA2-Accessory
An Open-source Toolkit for LLM Development
Paper-List
A paper list of my history reading. Robotics, Learning, Vision.
semantic-gaussians
Official implemetation of the paper "Semantic Gaussians: Open-Vocabulary Scene Understanding with 3D Gaussian Splatting".
3DGStream
[CVPR 2024 Highlight] Official repository for the paper "3DGStream: On-the-fly Training of 3D Gaussians for Efficient Streaming of Photo-Realistic Free-Viewpoint Videos".
detic-sam
Detic + SAM for open-vocabulary object detection and segmentation.
PaSCo
[CVPR 2024 Oral - Best paper award candidate] Official repository of "PaSCo: Urban 3D Panoptic Scene Completion with Uncertainty Awareness"
AnimatableGaussians
Code of [CVPR 2024] "Animatable Gaussians: Learning Pose-dependent Gaussian Maps for High-fidelity Human Avatar Modeling"
ISAT_with_segment_anything
Labeling tool with SAM(segment anything model),supports SAM, sam-hq, MobileSAM EdgeSAM etc.交互式半自动图像标注工具
odin
Code for the paper: "ODIN: A Single Model for 2D and 3D Segmentation" (CVPR 2024)
vid2avatar
Vid2Avatar: 3D Avatar Reconstruction from Videos in the Wild via Self-supervised Scene Decomposition (CVPR2023)