HiThink Research's repositories
BizFinBench
A Business-Driven Real-World Financial Benchmark for Evaluating LLMs
MME-Finance
[MM 2025] A Multimodal Finance Benchmark for Expert-level Understanding and Reasoning
PuzzleClone
PuzzleClone: An SMT-Powered Framework for Synthesizing Verified Mathematical Reasoning Data
PolyhedronEvaluator
PolyhedronEvaluator