Xu Luo (Frankluox)

Frankluox

Geek Repo

Company:UESTC

Location:Chengdu, China

Home Page:https://frankluox.github.io/

Twitter:@lux77215832

Github PK Tool:Github PK Tool

Xu Luo's starred repositories

DoraemonGPT

Official repository of DoraemonGPT: Toward Understanding Dynamic Scenes with Large Language Models

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:56Issues:0Issues:0

FSQ-pytorch

A Pytorch Implementation of Finite Scalar Quantization

Language:PythonStargazers:56Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:58Issues:0Issues:0

RoboGen

A generative and self-guided robotic agent that endlessly propose and master new skills.

Language:PythonLicense:Apache-2.0Stargazers:503Issues:0Issues:0

LLaMA-Pro

[ACL 2024] Progressive LLaMA with Block Expansion.

Language:PythonLicense:Apache-2.0Stargazers:431Issues:0Issues:0

SeeAct

[ICML'24] SeeAct is a system for generalist web agents that autonomously carry out tasks on any given website, with a focus on large multimodal models (LMMs) such as GPT-4V(ision).

Language:PythonLicense:NOASSERTIONStargazers:523Issues:0Issues:0
Language:PythonStargazers:69Issues:0Issues:0

Gemini-Commonsense-Evaluation

Official implementation of "Gemini in Reasoning: Unveiling Commonsense in Multimodal Large Language Models"

License:MITStargazers:35Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:528Issues:0Issues:0

MoTCoder

This is the official code repository of MoTCoder: Elevating Large Language Models with Modular of Thought for Challenging Programming Tasks.

Language:PythonStargazers:55Issues:0Issues:0

vstar

PyTorch Implementation of "V* : Guided Visual Search as a Core Mechanism in Multimodal LLMs"

Language:PythonLicense:MITStargazers:462Issues:0Issues:0

Large-Language-Models-play-StarCraftII

TextStarCraft2,a pure language env which support llms play starcraft2

Language:PythonStargazers:172Issues:0Issues:0

CHOCOLATE

Code and data for the paper "Do LVLMs Understand Charts? Analyzing and Correcting Factual Errors in Chart Captioning"

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:23Issues:0Issues:0

Osprey

[CVPR2024] The code for "Osprey: Pixel Understanding with Visual Instruction Tuning"

Language:PythonLicense:Apache-2.0Stargazers:711Issues:0Issues:0

MP5

[CVPR2024] This is the official implement of MP5

Language:PythonStargazers:66Issues:0Issues:0
Language:PythonStargazers:36Issues:0Issues:0

planning-as-inpainting

Planning as In-Painting: A Diffusion-Based Embodied Task Planning Framework for Environments under Uncertainty

Language:PythonStargazers:14Issues:0Issues:0

CM3Leon

An open source implementation of "Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction Tuning", an all-new multi modal AI that uses just a decoder to generate both text and images

Language:PythonLicense:MITStargazers:341Issues:0Issues:0

magicoder

Magicoder: Source Code Is All You Need

Language:PythonLicense:MITStargazers:1923Issues:0Issues:0

IMProv

IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks

Language:PythonStargazers:57Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:1671Issues:0Issues:0

PCA-EVAL

PCA-Bench: Evaluating Multimodal Large Language Models in Perception-Cognition-Action Chain

Language:Jupyter NotebookStargazers:96Issues:0Issues:0

GPT-4V-Act

AI agent using GPT-4V(ision) capable of using a mouse/keyboard to interact with web UI

Language:JavaScriptStargazers:911Issues:0Issues:0
Language:PythonStargazers:31Issues:0Issues:0
Language:PythonLicense:MITStargazers:11Issues:0Issues:0

LoRA-ViT

Low rank adaptation for Vision Transformer

Language:PythonLicense:GPL-3.0Stargazers:310Issues:0Issues:0

FedNoisy

Benchmark for federated noisy label learning

Language:PythonLicense:Apache-2.0Stargazers:16Issues:0Issues:0

multimodal-prompt-learning

[CVPR 2023] Official repository of paper titled "MaPLe: Multi-modal Prompt Learning".

Language:PythonLicense:MITStargazers:562Issues:0Issues:0

Vision-AGI-Survey

A temporary webpage for our survey in AGI for computer vision

Stargazers:115Issues:0Issues:0

LabelHalluc

[AAAI 2022] Label Hallucination for Few-Shot Classification

Language:PythonStargazers:25Issues:0Issues:0