aravindr93/mjrl Issues
gail
UpdatedUnderstanding obs_mask
Updated 1Not learning reward?
Updated 3NPG kl_mean is always 0
UpdatedNo RBF code?
UpdatedValue function approximator
Closed 1linear policy?
Closed 5Is mean KL always zero?
Closed 1
Reinforcement learning algorithms for MuJoCo tasks