Xu Luo's starred repositories
DoraemonGPT
Official repository of DoraemonGPT: Toward Understanding Dynamic Scenes with Large Language Models
FSQ-pytorch
A Pytorch Implementation of Finite Scalar Quantization
Gemini-Commonsense-Evaluation
Official implementation of "Gemini in Reasoning: Unveiling Commonsense in Multimodal Large Language Models"
Large-Language-Models-play-StarCraftII
TextStarCraft2,a pure language env which support llms play starcraft2
planning-as-inpainting
Planning as In-Painting: A Diffusion-Based Embodied Task Planning Framework for Environments under Uncertainty
GPT-4V-Act
AI agent using GPT-4V(ision) capable of using a mouse/keyboard to interact with web UI
multimodal-prompt-learning
[CVPR 2023] Official repository of paper titled "MaPLe: Multi-modal Prompt Learning".
Vision-AGI-Survey
A temporary webpage for our survey in AGI for computer vision
LabelHalluc
[AAAI 2022] Label Hallucination for Few-Shot Classification