Xiaobing Han's repositories

AI2BMD

AI-powered ab initio biomolecular dynamics simulation

License:MITStargazers:0Issues:0Issues:0

AirSLAM

🚀 AirVO upgrades to AirSLAM 🚀

License:GPL-3.0Stargazers:0Issues:0Issues:0

BiRefNet

[CAAI AIR'24] Bilateral Reference for High-Resolution Dichotomous Image Segmentation

License:MITStargazers:0Issues:0Issues:0
License:Apache-2.0Stargazers:0Issues:0Issues:0

ComCLIP

Official implementation and dataset for the NAACL 2024 paper "ComCLIP: Training-Free Compositional Image and Text Matching"

License:MITStargazers:0Issues:0Issues:0

composio

Composio equips agents with well-crafted tools empowering them to tackle complex tasks

License:NOASSERTIONStargazers:0Issues:0Issues:0

ControlNeXt

Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRA

License:Apache-2.0Stargazers:0Issues:0Issues:0

crab

CRAB: Cross-environment Agent Benchmark for Multimodal Language Model Agents. https://crab.camel-ai.org/

Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

DeepInteraction

[NeurIPS 2022] DeepInteraction: 3D Object Detection via Modality Interaction

License:MITStargazers:0Issues:0Issues:0

generative-ai-1

Sample code and notebooks for Generative AI on Google Cloud, with Gemini on Vertex AI

License:Apache-2.0Stargazers:0Issues:0Issues:0

generative-ai-python

The official Python library for the Google Gemini API

License:Apache-2.0Stargazers:0Issues:0Issues:0

graphrag

A modular graph-based Retrieval-Augmented Generation (RAG) system

License:MITStargazers:0Issues:0Issues:0

HAIR

The Official Implementation for "HAIR: Hypernetworks-based All-in-One Image Restoration".

Stargazers:0Issues:0Issues:0

lazygrounding

[ECCV'24] Official PyTorch implementation of In Defense of Lazy Visual Grounding for Open-Vocabulary Semantic Segmentation

Stargazers:0Issues:0Issues:0

mediapipe

Cross-platform, customizable ML solutions for live and streaming media.

License:Apache-2.0Stargazers:0Issues:0Issues:0

MetaSeg

MetaFormer-based Global Contexts-aware Network for Efficient Semantic Segmentation (Accepted by WACV 2024)

License:MITStargazers:0Issues:0Issues:0

MuCR

MuCR is a benchmark designed to evaluate Vision Large Language Models' (VLLMs) ability to infer causal relationships using only visual cues

Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

PromptClip

Instantly create video clips from LLM prompts

License:MITStargazers:0Issues:0Issues:0

ProxyCLIP

[ECCV2024] ProxyCLIP: Proxy Attention Improves CLIP for Open-Vocabulary Segmentation

Stargazers:0Issues:0Issues:0

RISurConv

Official codes for ECCV2024 paper: RISurConv: Rotation Invariant Surface Attention-Augmented Convolutions for 3D Point Cloud Classification and Segmentation

License:MITStargazers:0Issues:0Issues:0

SC4D

[ECCV 2024] Official code for: SC4D: Sparse-Controlled Video-to-4D Generation and Motion Transfer

Stargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0
License:Apache-2.0Stargazers:0Issues:0Issues:0

SLCA

Codes for ICCV 2023 paper: SLCA: Slow Learner with Classifier Alignment for Continual Learning on a Pre-trained Model

License:MITStargazers:0Issues:0Issues:0

SpatialBot

The official repo for "SpatialBot: Precise Spatial Understanding with Vision Language Models.

License:MITStargazers:0Issues:0Issues:0

unic

PyTorch code and pretrained weights for the UNIC models.

License:NOASSERTIONStargazers:0Issues:0Issues:0

vfusion3d-1

[ECCV 2024] Code for VFusion3D: Learning Scalable 3D Generative Models from Video Diffusion Models

License:NOASSERTIONStargazers:0Issues:0Issues:0

VITA

✨✨VITA: Towards Open-Source Interactive Omni Multimodal LLM

Stargazers:0Issues:0Issues:0