There are 0 repository under safety-evaluation topic.
S-Eval: Automatic and Adaptive Test Generation for Benchmarking Safety Evaluation of Large Language Models
SORRY-Bench: Systematically Evaluating Large Language Model Safety Refusal Behaviors