0.0's starred repositories
CVPR2023-Highlights
CVPR2023 Highlight papers
Awesome-Vision-Mamba
✨✨Latest Papers on Vision Mamba and Related Areas
ChatGemini
✨ ChatGemini 是一个基于 Google Gemini 的网页客户端,对标 ChatGPT 3.5,操作逻辑同 ChatGPT 3.5 一致,同时支持在聊天中上传图片,应用会自动调用 Gemini-Pro-Vision 模型进行识图。
visualwebarena
VisualWebArena is a benchmark for multimodal agents.
DoraemonGPT
Official repository of DoraemonGPT: Toward Understanding Dynamic Scenes with Large Language Models
Suspicion-Agent
The implementation of "Suspicion-Agent: Playing Imperfect Information Games with Theory of Mind Aware GPT-4"
awesome-in-context-learning
A curated list of in-context-learning, including classic and up-to-date papers📜
LLM-in-Vision
Recent LLM-based CV and related works. Welcome to comment/contribute!
LLaVA-Plus-Codebase
LLaVA-Plus: Large Language and Vision Assistants that Plug and Learn to Use Skills
LLM-Agent-Paper-Digest
papers related to LLM-agent that published on top conferences
tree-of-thought-llm
[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models
InstructCV
[ ICLR 2024 ] Official Codebase for "InstructCV: Instruction-Tuned Text-to-Image Diffusion Models as Vision Generalists"