sml8648 / LLM-Agent-Survey

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

A Survey on LLM-based Autonomous Agents

Autonomous artificial intelligence (AI) agents are designed to accomplish specific objectives by self-guided instructions, automatic memorization, planning, and action. With the advent and prospering of large language models (LLMs), there is an expanding frontier in using LLMs as core controllers for these autonomous entities. However, a unified view that ties together the diverse studies in this field has been lacking. This repository houses a comprehensive and systematic survey that fills this gap, focusing on LLM-based autonomous AI agents in their construction, applications, and evaluation strategies.

In particular, we explore the essential components of an AI agent, including a profile module, a memory module, a planning module, and an action module. We further investigate the potential applications in natural and social sciences and introduce methods to evaluate their effectiveness. Challenges and future directions of this field are also discussed.

Our paper and this repository aim to serve as a resource for researchers and practitioners alike, providing insights, related references, and continuous updates on this exciting and rapidly evolving field.

News

Table of Contents

Growth Trend of LLM-based Autonomous Agent

Growth Trend


Structure of the Survey

Structure

Construction of LLM-based Autonomous Agent

Architecture Design

Model Profile Memory Planning Action LS Time Paper Code
Operation Structure
WebGPT - - - - w/ tools Human feedback 12/2021 Paper -
SayCan - - - w/o feedback w/ tools Environment feedback 04/2022 Paper Code
MRKL - - - w/o feedback w/ tools - 05/2022 Paper -
Inner Monologue - - - w/ feedback w/ tools Environment feedback 07/2022 Paper Code
Social Simulacra GPT-Generated - - - w/o tools - 08/2022 Paper -
ReAct - - - w/ feedback w/ tools Environment feedback 10/2022 Paper Code
REPLUG - Read/Write/Reflection Unified - w/o tools - 01/2023 Paper -
DEPS - - - w/ feedback w/ tools Environment feedback 02/2023 Paper -
Toolformer - - - w/o feedback w/ tools Example 02/2023 Paper Code
Reflexion - Read/Write/Reflection Hybrid w/ feedback w/o tools Environment feedback 03/2023 Paper Code
CAMEL Handcrafting & GPT-Generated - - w/ feedback w/o tools - 03/2023 Paper Code
API-Bank - - - w/ feedback w/ tools Example 04/2023 Paper Code
ViperGPT - - - - w/ tools - 03/2023 Paper Code
HuggingGPT - Read/Write Unified w/o feedback w/ tools - 03/2023 Paper Code
Generative Agents Handcrafting Read/Write/Reflection Hybrid w/o feedback w/o tools - 04/2023 Paper Code
LLM+P - - - w/o feedback w/ tools - 04/2023 Paper -
ChemCrow - - - w feedback w/ tools - 04/2023 Paper Code
OpenAGI - - - w feedback w/ tools Example 04/2023 - Code
AutoGPT - Read/Write Hybrid w feedback w tools Environment feedback 04/2023 - Code
SCM - Read/Write Hybrid - w/o tools - 04/2023 Paper Code
Socially Alignment - Read/Write Hybrid - w/o tools Example 05/2023 Paper Code
GITM - Read/Write/Reflection Hybrid w/ feedback w/o tools Environment feedback 05/2023 Paper Code
Voyager - Read/Write/Reflection Unified w/ feedback w/o tools Environment feedback 05/2023 Paper Code
Introspective Tips - Read/Write/Reflection Unified w/ feedback w/o tools Example & Environment feedback 05/2023 Paper -
RET-LLM - Read/Write/Reflection Unified - w/o tools Example 05/2023 Paper -
ChatDB - Read/Write/Reflection Unified w/ feedback w/ tools - 06/2023 Paper -
S3 Dataset alignment Read/Write/Reflection Hybrid w/o feedback w/o tools - 07/2023 Paper -
ChatDev Handcrafting Read/Write/Reflection Unified w/ feedback w/o tools - 07/2023 Paper Code
ToolLLM - - - w/ feedback w/ tools Example 07/2023 Paper Code
MemoryBank - Read/Write/Reflection Hybrid - w/o tools - 07/2023 Paper Code
MetaGPT Handcrafting Read/Write/Reflection Hybrid w/ feedback w/ tools - 08/2023 Paper Code

This table only summarizes partial previous work. See section More comprehensive Summarization for more papers.

Applications of LLM-based Autonomous Agent

Title Social Science Natural Science Engineering Time Paper Code
SayCan - - Robotics & Embodied AI 04/2022 Paper Code
Social Simulacra Social Simulation - - 08/2022 Paper -
TE Psychology - - 08/2022 Paper Code
Out of One Political Science and Economy - - 09/2022 Paper -
LangChain Psychology - - 10/2022 - Code
Blind Judgement Jurisprudence - - 01/2023 Paper -
Horton Political Science and Economy - - 01/2023 Paper -
DECKARD - - Robotics & Embodied AI 01/2023 Paper Code
Planner-Actor-Reporter - - Robotics & Embodied AI 02/2023 Paper -
Auto-GPT - - General Autonomous Agents 03/2023 - Code
BMTools - - General Autonomous Agents 03/2023 Paper Code
BabyAGI - - General Autonomous Agents 04/2023 - Code
REFINER Psychology - - 04/2023 Paper Code
AgentGPT - - General Autonomous Agents 04/2023 Paper Code
Generative Agents Social Simulation - - 04/2023 Paper Code
AI-legion - - General Autonomous Agents 04/2023 - Code
LoopGPT - - General Autonomous Agents 04/2023 Paper Code
SCG - - CS&SE 04/2023 Paper -
AGiXT - - General Autonomous Agents 04/2023 - Code
IGLU - - Civil Engineering 04/2023 Paper -
IELLM - - Aerospace Engineering 04/2023 Paper -
GPT4IA - - Industrial Automation 04/2023 Paper Code
Workgpt - - General Autonomous Agents 05/2023 - Code
AgentVerse Social Simulation - - 05/2023 - Code
GPT Researcher - - General Autonomous Agents 05/2023 Paper Code
SmolModels - - CS&SE 05/2023 - Code
SuperAGI - - General Autonomous Agents 05/2023 - Code
VOYAGER - - Robotics & Embodied AI 05/2023 Paper Code
LLM4RL - - Robotics & Embodied AI 06/2023 Paper -
GPT Engineer - - CS&SE 06/2023 - Code
MiniAGI - - General Autonomous Agents 06/2023 - Code
DemoGPT - - General Autonomous Agents;
CS&SE
06/2023 - Code
TaPA - - Robotics & Embodied AI 07/2023 Paper -
Math Agents - - Science Education 07/2023 Paper -
SocialAI School Social Simulation - - 07/2023 Paper -
Unified Agent - - Robotics & Embodied AI 07/2023 Paper -
S3 Social Simulation - - 07/2023 Paper -
Dialogue Shaping - - Robotics & Embodied AI 07/2023 Paper -
ToolLLM - - CS&SE 07/2023 Paper Code
ChatMOF - Document and Data Management - 08/2023 Paper -
MetaGPT - - CS&SE 08/2023 Paper Code
AutoGen - - General Autonomous Agents 08/2023 Paper Code

This table only summarizes partial previous work. See section More comprehensive Summarization for more papers.

Evaluation on LLM-based Autonomous Agent

Model Subjective Objective Benchmark Time Paper Code
WebShop - Environment Simluation;
Isolated Reasoning;
Multi-task Evaluation
07/2022 Paper Code
Social Simulacra Human Annotation Social Evaluation - 08/2022 Paper -
TE - Social Evaluation - 08/2022 Paper Code
LIBRO - Software Testing - 09/2022 Paper -
ReAct - Environment Simluation 10/2022 Paper Code
Out of One, Many Turing Test Isolated Reasoning;
Multi-task Evaluation
- 02/2023 Paper -
DEPS - Environment Simluation 02/2023 Paper -
Jalil et al. - Software Testing - 02/2023 Paper Code
Reflexion - Isolated Reasoning - 03/2023 Paper Code
IGLU - Environment Simluation 04/2023 Paper -
LLM+P - Isolated Reasoning - 04/2023 Paper -
Generative Agents Human Annoation;
Turing Test
- - 04/2023 Paper Code
ToolLLM Human Annoation Multi-task Evalution 04/2023 Paper Code
GITM - Environment simulation 05/2023 Paper Code
Two-Failures - Environment Simulation - 05/2023 Paper -
Voyager - Environment Simulation 05/2023 Paper Code
SocKET - Isolated Reasoning;
Social Evaluation;
Multi-task Evaluation
05/2023 Paper -
Mobile-Env - Environment Simluation;
Isolated Reasoning;
Multi-task Evaluation
05/2023 Paper Code
clembench - Environment Simluation 05/2023 Paper Code
Dialop - Social Evaluation 06/2023 Paper Code
ChatDB - Isolated Reasoning - 06/2023 Paper -
Feldt et al. - Software Testing - 06/2023 Paper -
CO-LLM Human Annoation Environment Simulation - 07/2023 Paper Code
Tachikuma Human Annoation Environment Simluation 07/2023 Paper -
ChatDev - Isolated Reasoning - 07/2023 Paper -
WebArena - Environment Simluation 07/2023 Paper Code
AgentSims - Social Evaluation - 08/2023 Paper Code
AgentBench - Multi-task Evaluation 08/2023 Paper Code
BOLAA - Environment Simulation;
Multi-task Evaluation;
Software Testing
08/2023 Paper Code
Gentopia - Isolated Reasoning;
Multi-task Evaluation
08/2023 Paper Code

This table only summarizes partial previous work. See section More comprehensive Summarization for more papers.


More Comprehensive Summarization

We are maintaining an interactive table that contains more comprehensive papers related to LLM-based Agents. This table includes details such as tags, authors, publication date, and more, allowing you to sort, filter, and find the papers of interest to you. Complete Table

Maintainers

Citation

If you find this survey useful, please cite our paper:

@misc{wang2023survey,
      title={A Survey on Large Language Model based Autonomous Agents}, 
      author={Lei Wang and Chen Ma and Xueyang Feng and Zeyu Zhang and Hao Yang and Jingsen Zhang and Zhiyuan Chen and Jiakai Tang and Xu Chen and Yankai Lin and Wayne Xin Zhao and Zhewei Wei and Ji-Rong Wen},
      year={2023},
      eprint={2308.11432},
      archivePrefix={arXiv},
      primaryClass={cs.AI}
}

How to Contribute

We warmly welcome contributions to this survey and interactive table. If you have a paper or know of relevant research that should be included, please contribute through pull requests, issues, email, or other methods.

Contact Us

If you have any questions or suggestions, please contact us via:

About