Data and code for paper "M3Exam: A Multilingual, Multimodal, Multilevel Benchmark for Examining Large Language Models"
Geek Repo:Geek Repo
Github PK Tool:Github PK Tool