MiuLab / DDQ

Deep Dyna-Q: Integrating Planning for Task-Completion Dialogue Policy Learning

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

what is pre-training dqn model and world model ? initialize Q(s; a; θQ) and M(s; a; θM) via pre-training on human conversational data?

netrookiecn opened this issue · comments

Hi
I dont understand the pretraining of the world model because I can not find the pretraining process in your code, can you explain me what is that?
and where is the pretraining dqn model and world model in your repo?
thanks

I have the same question. lol