Implement skeleton based on stable-baselines

Question

dniku opened this issue 5 years ago · comments

Approximate requirements:

Script that trains an agent with one of the existing algorithms in stable-baselines
Runs locally, no need to adapt for cloud
Fixes random seed
Specifies network architecture (might be a copy of a predefined one), not imports it from the library
Saves model after training
Saves metrics after training (similar to ones reported in LTIAA paper)

Let's make attention visualization a part of another issue (inb4: gym has video recording).

Dmitry Nikulin · Answer 1 · Mon Mar 25 2019 02:05:56 GMT+0800 (China Standard Time)

Remains: