yupeijei1997 / MMTB

Multi-Mission Tool Bench: Assessing the Robustness of LLM based Agents through Related and Dynamic Missions

Home Page:https://arxiv.org/abs/2504.02623

Repository from Github https://github.comyupeijei1997/MMTBRepository from Github https://github.comyupeijei1997/MMTB

yupeijei1997/MMTB Stargazers