ELO ranking score?

Question

ELO ranking score?

Tokkiu opened this issue 4 months ago · comments

How to generate this ranking? If I added new model, how to reproduce this benchmark?

高璟琦 · Answer 1 · Wed Apr 10 2024 11:35:49 GMT+0800 (China Standard Time)

My new model is implemented in this pr. https://github.com/OpenGenerativeAI/llm-colosseum/pull/45/files
You can watch the video of my model vs mistral at here.
https://github.com/Tokkiu/llm-colosseum?tab=readme-ov-file#1-vs-1-mistral-vs-solar

shawokou123 · Answer 2 · Wed Apr 10 2024 13:15:55 GMT+0800 (China Standard Time)

我的新模型已经在这个 PR 中实现。https://github.com/OpenGenerativeAI/llm-colosseum/pull/45/files您可以在这里观看我的模型与 Mistral 的视频。 https://github.com/Tokkiu/llm-colosseum?tab=readme-ov-file#1-vs-1-mistral-vs-solar

你好璟琦，我对这个项目也非常感兴趣，可以交流吗？

taozhiyuai · Answer 3 · Fri Apr 12 2024 20:24:01 GMT+0800 (China Standard Time)

I just launch 50 rounds for two models. the result shows who is a better models. at the moment, Gemma 7B is the best. v1.1 is worse.