Change puzzle win probability

Question

Change puzzle win probability

cameron-martin opened this issue 5 years ago · comments

#15 picks puzzles with 50% chance of solving. We must come up with a way of setting this probability to something different.

The following properties must hold:

The probability of the user solving puzzles given to them should be a fixed value. This arises from the notion that "puzzles should be of the correct difficulty" by just defining difficulty as the probability of the user solving a puzzle.
Every puzzle should have a non-zero probability of being shown to a user. This ensures that all puzzles will eventually get some games, and therefore have an opportunity to be rated.
Users should not be shown the same puzzle "too often". This is a bit of vague requirement currently.
The probability of a puzzle being chosen only depends on the rating and rating deviation.

Let be the probability that puzzle i is shown to the user. This is the distribution that we must sample from when showing the user a puzzle, but currently I do not know how to compute this distribution.

Let be the probability that the user wins. We want this to be a fixed value.

is the probability that a user wins given the puzzle is puzzle i. This is a known value and can be computed as shown in the glicko paper.

The relationship between these values is as follows:

Tim Lawson · Answer 1 · Fri Jan 03 2020 02:28:24 GMT+0800 (China Standard Time)

It should be explicit that the probability puzzle is shown to the user depends on the score and rating deviation of both the puzzle and user, i.e. that the function is .

I think the best way forward is to assert a form for and optimise its parameters such that is relatively constant (within an interval?) over representative values of and $\{r, RD\}$ for all . We may be able to source this from chess data - it seems reasonable to assume that the distribution of user/puzzle ratings and rating deviations is similar.

The obvious choice of form is polynomials of increasing degree, or, in the extreme, an artificial neural network.

Edit: I hadn't appreciated that depends on all puzzles ...

Tim Lawson · Answer 2 · Fri Jan 03 2020 04:20:04 GMT+0800 (China Standard Time)

Alternatively, given a value of $p(w|P_u = P_{u_{i}})$ for a user and all puzzles (which we could recompute each rating period?), we could select puzzles such that the distribution of $p(w|P_u = P_{u_{i}})$ is approximately a chosen bounded domain distribution on [0, 1].