Xiaobing Han's repositories
GaussianShader
code for GaussianShader: 3D Gaussian Splatting with Shading Functions for Reflective Surfaces
RALF
Official implementation of CVPR 2024 paper "Retrieval-Augmented Open-Vocabulary Object Detection".
stablediffusion
High-Resolution Image Synthesis with Latent Diffusion Models
embodied-generalist
[ICML 2024] Official code repository for 3D embodied generalist agent LEO
RoadBEV
Codes for RoadBEV: road surface reconstruction in Bird's Eye View
X3D-Edit
X3D-Edit is an Extensible 3D (X3D) Graphics authoring tool for simple error-free creation, editing, validation and viewing of X3D scenes for interactive Web-based visualization. X3D-Edit runs as a standalone application or Netbeans plugin. The X3D file format is an advanced XML version of the original VRML97 international standard.
Qwen-VL
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
sigllm
LLMs for sintel
PaliGemma-FineTuning
PaliGemma FineTuning
T-Rex
API for T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy
cape
Computational Aerosciences Productivity & Execution
ChatSim
[CVPR2024 Highlight] Editable Scene Simulation for Autonomous Driving via LLM-Agent Collaboration
GALA3D
[ICML 2024] GALA3D: Towards Text-to-3D Complex Scene Generation via Layout-guided Generative Gaussian Splatting
SimDistill
The official repo for [AAAI 2024] "SimDistill: Simulated Multi-modal Distillation for BEV 3D Object Detection""
papper-3D-detection
papper,overview,
emoca
Official repository accompanying a CVPR 2022 paper EMOCA: Emotion Driven Monocular Face Capture And Animation. EMOCA takes a single image of a face as input and produces a 3D reconstruction. EMOCA sets the new standard on reconstructing highly emotional images in-the-wild
DriveLM
DriveLM: Driving with Graph Visual Question Answering
GenAD
GenAD: Generative End-to-End Autonomous Driving
tutel
Tutel MoE: An Optimized Mixture-of-Experts Implementation
Awesome-Talking-Head-Synthesis
💬 An extensive collection of exceptional resources dedicated to the captivating world of talking face synthesis! ⭐ If you find this repo useful, please give it a star! 🤩
transfuser
[PAMI'23] TransFuser: Imitation with Transformer-Based Sensor Fusion for Autonomous Driving; [CVPR'21] Multi-Modal Fusion Transformer for End-to-End Autonomous Driving
gauzilla
Gauzilla: a 3D Gaussian Splatting renderer written in Rust for WebAssembly with lock-free multithreading
OverlapMamba
OverlapMamba: Novel Shift State Space Model for LiDAR-based Place Recognition
awesome-3d-diffusion
A collection of papers on diffusion models for 3D generation.
foundry
Foundry is a blazing fast, portable and modular toolkit for Ethereum application development written in Rust.
tutorials
MONAI Tutorials
Cam4DOcc
[CVPR 2024] Cam4DOcc: Benchmark for Camera-Only 4D Occupancy Forecasting in Autonomous Driving Applications