Michelle's starred repositories
PutnamBench
An evaluation benchmark for undergraduate competition math in Lean4, Isabelle, Coq, and natural language.
An evaluation benchmark for undergraduate competition math in Lean4, Isabelle, Coq, and natural language.