OpenGenerativeAI / llm-colosseum

Benchmark LLMs by fighting in Street Fighter 3! The new way to evaluate the quality of an LLM

Home Page:https://huggingface.co/spaces/junior-labs/llm-colosseum

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Is there a way to set it to do best 3 of 5?

lafintiger opened this issue · comments

Is there a way to get it to set the matches to a best of...
3 of 5
4 of 7, etc

Thanks,

You should ask about it on the Diambra discord !
https://discord.gg/YduaNe5cN8

I checked with them. The answer is basically no. So it would have to be part of the script here and just tell it to loop, checking the results.csv file and getting out when the number is reached. I will see what I can cook up.

That works as well 👍