DQN rlax + bsuite vs rlax + gymnax

Question

DQN rlax + bsuite vs rlax + gymnax

RobertTLange opened this issue 3 years ago · comments

Robert Tjarko Lange commented 3 years ago

I would like to have a benchmark figure comparing the DQN example in rlax with a gymnax sped up version. Ideally, I want to compare the runtime for step transitions on different devices.

At the moment there is something wrong with the optimisation and/or evaluation. Figure out the bug 🐛.

The agents should all be in an experimental directory.

Robert Tjarko Lange · Answer 1 · Wed Apr 21 2021 15:41:16 GMT+0800 (China Standard Time)

Alternatively/additionally we can simply drop in for the Anakin Catch Example.
Also add the CMA-ES example for Pendulum-v0 as a notebook!

Robert Tjarko Lange · Answer 2 · Mon May 03 2021 16:14:00 GMT+0800 (China Standard Time)

Addressed in d7e262b and 9b92dbe.