babelcloud / LLM-RGB

LLM Reasoning and Generation Benchmark. Evaluate LLMs in complex scenarios systematically.

Home Page:http://gru.ai

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

babelcloud/LLM-RGB Issues