What are differences between Chinese and Chinese Subset leaderboards
zhimin-z opened this issue · comments
JIMMY ZHAO commented
JIMMY ZHAO commented
Official github repo for SafetyBench, a comprehensive benchmark to evaluate LLMs' safety.
zhimin-z opened this issue · comments