THUDM / AgentBench

A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)

Home Page:https://llmbench.ai

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

THUDM/AgentBench Stargazers