hayashimasa / Robust_MuZero

A robust variant of MuZero

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Robust MuZero

A robust variant of MuZero

Authors: Masahiro Hayashi, Bakr Ouairem

Supervisors: Edouard Leurent, Odalric-Ambrym Maillard

Overview

In order to propose a robust variant of MuZero, we first have to make the state transitions stochastic; to do so, we incorporate the work from PlaNet; namely, we integrate the SSM and RSSM model into the dynamics function of MuZero. We also modify MCTS so that the agent plans conservatively and robustly. So far, we haven't built a model that converges to any meaningful result.

About

A robust variant of MuZero

License:MIT License


Languages

Language:Python 100.0%