what is pre-training dqn model and world model ? initialize Q(s; a; θQ) and M(s; a; θM) via pre-training on human conversational data？

Question

what is pre-training dqn model and world model ? initialize Q(s; a; θQ) and M(s; a; θM) via pre-training on human conversational data？

netrookiecn opened this issue 5 years ago · comments

Hi
I dont understand the pretraining of the world model because I can not find the pretraining process in your code, can you explain me what is that?
and where is the pretraining dqn model and world model in your repo?
thanks

Dr-Corgi · Answer 1 · Wed Apr 08 2020 11:21:18 GMT+0800 (China Standard Time)

I have the same question. lol