drq_jax

Jax Implementation of Data-regularized Q (DrQ)

(It's my lecture project of Reinforcement Learning :)

python drq.py cfg=walker_walk train_seed=0

Running the code requires ≈38 GB GPU memory.

As I can access large memory GPUs, so I did not implement a memory-efficient replay buffer for image observations.

Leave it for future work (下次一定!)

About

Jax Implementation of Data-regularized Q (DrQ)

Language:Python 100.0%