An evaluation benchmark for undergraduate competition math in Lean4, Isabelle, Coq, and natural language.
Geek Repo:Geek Repo
Github PK Tool:Github PK Tool