Xiaobing Han's repositories

apollo

An open autonomous driving platform

License:Apache-2.0Stargazers:0Issues:0Issues:0

autoware

Autoware - the world's leading open-source software project for autonomous driving

License:Apache-2.0Stargazers:0Issues:0Issues:0
License:Apache-2.0Stargazers:0Issues:0Issues:0

BEV-Perception

Bird's Eye View Perception

License:MITStargazers:0Issues:0Issues:0

BEVFormer

[ECCV 2022] This is the official implementation of BEVFormer, a camera-only framework for autonomous driving perception, e.g., 3D object detection and semantic map segmentation.

License:Apache-2.0Stargazers:0Issues:0Issues:0
License:Apache-2.0Stargazers:0Issues:0Issues:0

CityGaussian

Repository for CityGaussian: Real-time High-quality Large-Scale Scene Rendering with Gaussians

Stargazers:0Issues:0Issues:0

corenet

CoreNet: A library for training deep neural networks

License:NOASSERTIONStargazers:0Issues:0Issues:0

D-iGPT

[ICML 2024] This repository includes the official implementation of our paper "Rejuvenating image-GPT as Strong Visual Representation Learners"

Stargazers:0Issues:0Issues:0
License:EPL-2.0Stargazers:0Issues:0Issues:0
License:NOASSERTIONStargazers:0Issues:0Issues:0

Groma

Grounded Multimodal Large Language Model with Localized Visual Tokenization

License:Apache-2.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

Infusion

Official implementations for paper: InFusion: Inpainting 3D Gaussians via Learning Depth Completion from Diffusion Prior

License:MITStargazers:0Issues:0Issues:0

interactive3d

[CVPR'24] Interactive3D: Create What You Want by Interactive 3D Generation

Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

mamba360

State Space Models

Stargazers:0Issues:0Issues:0

Mask_RCNN

Mask R-CNN for object detection and instance segmentation on Keras and TensorFlow

License:NOASSERTIONStargazers:0Issues:0Issues:0

MCTF

Official implementation of CVPR 2024 paper "Multi-criteria Token Fusion with One-step-ahead Attention for Efficient Vision Transformers".

License:MITStargazers:0Issues:0Issues:0

Metric3D

The repo for "Metric3D: Towards Zero-shot Metric 3D Prediction from A Single Image" and "Metric3Dv2: A Versatile Monocular Geometric Foundation Model..."

License:CC0-1.0Stargazers:0Issues:0Issues:0

MicroDreamer

Official implementation of "MicroDreamer: Zero-shot 3D Generation in ~20 Seconds by Score-based Iterative Reconstruction".

License:Apache-2.0Stargazers:0Issues:0Issues:0

MiniGPT-3D

MiniGPT-3D: Efficiently Aligning 3D Point Clouds with Large Language Models using 2D Priors(Under Review)

Stargazers:0Issues:0Issues:0

mllm

Fast Multimodal LLM on Mobile Devices

License:MITStargazers:0Issues:0Issues:0

MLLM-Bench

MLLM-Bench: Evaluating Multimodal LLMs with Per-sample Criteria

Stargazers:0Issues:0Issues:0

mvs_objaverse

A little repo to render objaverse objects with blender

License:MITStargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0

PaddleMIX

Paddle Multimodal Integration and eXploration, supporting mainstream multi-modal tasks, including end-to-end large-scale multi-modal pretrain models and diffusion model toolbox. Equipped with high performance and flexibility.

License:Apache-2.0Stargazers:0Issues:0Issues:0

RSCaMa

RSCaMa: Remote Sensing Image Change Captioning with State Space Model

Stargazers:0Issues:0Issues:0

VMamba

VMamba: Visual State Space Models,code is based on mamba

Stargazers:0Issues:0Issues:0

xtuner

An efficient, flexible and full-featured toolkit for fine-tuning large models (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)

License:Apache-2.0Stargazers:0Issues:0Issues:0