let score difference be reflected in rating

Question

let score difference be reflected in rating

jonathan-scholz opened this issue 2 years ago · comments

When you enter scores into rate(), the difference between the scores have no effect on the rating - meaning: rate([team1,team2],score(1,0)) == rate([team1,team2],score(100,0)) is true.
They have exactly the same rating effect on team1 and team2.

I don't know if it is mathematical possible and how it would look like. But it would be great if the difference could be somehow factored into the calculation, as it is (if your game has a score) quite an important datapoint for skill evaluation.

Vivek Joshy · Answer 1 · Mon Sep 19 2022 16:21:30 GMT+0800 (China Standard Time)

I don't know if it is mathematical possible and how it would look like. But it would be great if the difference could be somehow factored into the calculation, as it is (if your game has a score) quite an important datapoint for skill evaluation.

I'm not aware of any papers that successfully implement score margins for any of the models in this library. There is a 2020 paper by Kovalchik that does it somewhat successfully for Elo with 4 different MOV models, but I'm not aware of how to adapt it to the Weng-Lin models.

Sources:

Kovalchik, S. (2020). Extension of the Elo rating system to margin of victory. International Journal of Forecasting, 36(4), 1329–1341. https://doi.org/10.1016/j.ijforecast.2020.01.006

‮Philihp Busby · Answer 2 · Wed Sep 21 2022 15:16:34 GMT+0800 (China Standard Time)

I think this is out of scope, but don't let me stop you. You'll want to create a new model and benchmark it against the other models which are naive to score margins.

jonathan-scholz · Answer 3 · Wed Sep 21 2022 15:28:32 GMT+0800 (China Standard Time)

Thank you @daegontaven for the hint to the resource. I was looking for a paper like this but couldn't find it. Let's see, maybe I can come up with something.

jonathan-scholz · Answer 4 · Thu Sep 22 2022 16:18:50 GMT+0800 (China Standard Time)

After looking into some papers it seems, this is not easily implemented. It depends a lot on the specific game/sport and needs to be fitted before applicable. I will write something for my use case, but not a general solution for the package.