Report Different models fight on the street

Question

Report Different models fight on the street

taozhiyuai opened this issue 4 months ago · comments

taozhiyuai commented 4 months ago

two models fight 50 rounds, the report is below

Nicolas Oulianov · Answer 1 · Mon Apr 01 2024 23:57:49 GMT+0800 (China Standard Time)

Impressive results ! Finally a benchmark where Gemma wins lol

You should have a file called "results.csv", right ? Is it the one you used to compute the win rates ?

taozhiyuai · Answer 2 · Tue Apr 02 2024 00:37:18 GMT+0800 (China Standard Time)

yes, the data in the table are all from results.cvs

taozhiyuai · Answer 3 · Tue Apr 02 2024 00:48:16 GMT+0800 (China Standard Time)

Impressive results ! Finally a benchmark where Gemma wins lol

You should have a file called "results.csv", right ? Is it the one you used to compute the win rates ?

I try to choose the same size of model parameters, or the same file size of model with same Q level. try to keep similar speed of token generation. big model always fail because of low speed of token generation.

I think Gemma 7b is good enough , it is time to train the model.

Nicolas Oulianov · Answer 4 · Tue Apr 02 2024 08:32:41 GMT+0800 (China Standard Time)

You want to do finetuning ?

taozhiyuai · Answer 5 · Tue Apr 02 2024 09:43:11 GMT+0800 (China Standard Time)

You want to do finetuning ?

yes, it is interesting.