google-deepmind / open_spiel

Hi.
Is there any way to evaluate a model trained with the rnad algorithm against a random agent? (like tic_tac_toe_dqn_vs_tabular.py for example)
In tic_tac_toe_dqn_vs_tabular.py, the action is taken from the return value of the step function (type step_output) and env is transitioned to the next state.
However, the return value of the step function in rnad is of type dect, so I ',m not sure if I can do the same thing in this way.
Is there any better way to do this?

Ok so you can do it, but you need a few steps because the RNaD implementation does not expose the policy as an RNaD agent.

You can get the policy for any state using action_probabilities:

open_spiel/open_spiel/python/algorithms/rnad/rnad.py

Line 985 in 7c58b6c

def action_probabilities(self,

(you can get the state from the environment using get_state accessor in rl_environment). Then you can just sample an action from the policy and take a step on the environment using the action.

Hope this helps!

Thank you for your reply.
By following your comments, I was able to do what I wanted to do!
It helped, thank you very much.

No problemo, glad it helped!

Question: how to evaluate rnad algorithm