Beast code in Giters

0.0's starred repositories

CVPR2023-Highlights

CVPR2023 Highlight papers

1000

CheXagent

[Arxiv-2024] CheXagent: Towards a Foundation Model for Chest X-Ray Interpretation

Language:Python9600

Awesome-Vision-Mamba

✨✨Latest Papers on Vision Mamba and Related Areas

5600

ChatGemini

✨ ChatGemini 是一个基于 Google Gemini 的网页客户端，对标 ChatGPT 3.5，操作逻辑同 ChatGPT 3.5 一致，同时支持在聊天中上传图片，应用会自动调用 Gemini-Pro-Vision 模型进行识图。

Language:TypeScriptMIT89800

visualwebarena

VisualWebArena is a benchmark for multimodal agents.

Language:PythonMIT15700

DoraemonGPT

Official repository of DoraemonGPT: Toward Understanding Dynamic Scenes with Large Language Models

BSD-3-Clause5000

Suspicion-Agent

The implementation of "Suspicion-Agent: Playing Imperfect Information Games with Theory of Mind Aware GPT-4"

Language:Python12400

agentlego

Enhance LLM agents with versatile tool APIs

Language:PythonApache-2.029800

STEVE

⛏💎 STEVE in Minecraft is for See and Think: Embodied Agent in Virtual Environment

MIT2700

awesome-in-context-learning

A curated list of in-context-learning, including classic and up-to-date papers📜

600

clin

Language:JavaScript7100

CaFo

[CVPR 2023] Prompt, Generate, then Cache: Cascade of Foundation Models makes Strong Few-shot Learners

Language:PythonMIT32800

LLM-in-Vision

Recent LLM-based CV and related works. Welcome to comment/contribute!

74600

LLaVA-Plus-Codebase

LLaVA-Plus: Large Language and Vision Assistants that Plug and Learn to Use Skills

Language:PythonApache-2.064200

PASTA

PASTA: Post-hoc Attention Steering for LLMs

Language:PythonMIT8400

SoM

Set-of-Mark Prompting for LMMs

Language:PythonMIT98700

LEFT

Language:Python3200

LLM-Agent-Paper-Digest

papers related to LLM-agent that published on top conferences

28600

tree-of-thought-llm

[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models

Language:PythonMIT427800

Cola

[NeurIPS2023] Official implementation of the paper "Large Language Models are Visual Reasoning Coordinators"

Language:Jupyter NotebookNOASSERTION9600

VL-PET

[ICCV2023] Official code for "VL-PET: Vision-and-Language Parameter-Efficient Tuning via Granularity Control"

Language:PythonMIT4900

InstructCV

[ ICLR 2024 ] Official Codebase for "InstructCV: Instruction-Tuned Text-to-Image Diffusion Models as Vision Generalists"

Language:PythonNOASSERTION51400

DetGPT

Language:Jupyter NotebookBSD-3-Clause72600

MixPHM

[CVPR 2023] Pytorch Code of MixPHM: Redundancy-Aware Parameter-Efficient Tuning for Low-Resource Visual Question Answering

Language:PythonMIT1300

MIC

MMICL, a state-of-the-art VLM with the in context learning ability from ICL, PKU

Language:Python30400

Cheetah

Language:PythonBSD-3-Clause32700

TIP

Multimodal-Procedural-Planning

Language:Python8900

LISA

Project Page for "LISA: Reasoning Segmentation via Large Language Model"

Language:PythonApache-2.0151100

LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

Language:Jupyter NotebookBSD-3-Clause898600

Qwen-VL

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Language:PythonNOASSERTION401600