Multi-Mission Tool Bench: Assessing the Robustness of LLM based Agents through Related and Dynamic Missions
Home Page:https://arxiv.org/abs/2504.02623
Repository from Github https://github.comyupeijei1997/MMTBRepository from Github https://github.comyupeijei1997/MMTB