night-chen / ToolQA

ToolQA, a new dataset to evaluate the capabilities of LLMs in answering challenging questions with external tools. It offers two levels (easy/hard) across eight real-life scenarios.

Home Page:https://arxiv.org/pdf/2306.13304.pdf

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

night-chen/ToolQA Stargazers