ZubinGou / math-evaluation-harness

A simple toolkit for benchmarking LLMs on mathematical reasoning tasks. 🧮✨

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

ZubinGou/math-evaluation-harness Stargazers