Evaluation script released?

Question

Evaluation script released?

hxhcreate opened this issue 7 months ago · comments

I was following this work.
It would be greatly appreciated if you could release the evaluation code to help us reproduce your results!

Shengye Wan · Answer 1 · Tue Apr 02 2024 07:06:23 GMT+0800 (China Standard Time)

Hi there, could you please provide more information, such as whether your question is about Llama Guard or CyberSecEval, and what exact script you are looking for? Thanks.

Manish · Answer 2 · Tue Apr 16 2024 04:33:08 GMT+0800 (China Standard Time)

Which script are you looking for?

Ujjwal Karn · Answer 3 · Sat Apr 20 2024 05:36:04 GMT+0800 (China Standard Time)

Hi, if you're looking for Llama Guard evaluation, Llama recipes has a script for running inference. We then use sklearn's precision_score, recall_score, f1_score, average_precision_score to compute the metrics. Is this what you're looking for?

I will close this issue, but please reopen if you have further questions. Thanks!