aravindr93 / mjrl

Reinforcement learning algorithms for MuJoCo tasks

aravindr93/mjrl Issues

gail
Updated a year ago
Experimental results of MoREL for D4RL benchmarks
Updated a year ago8
Understanding obs_mask
Updated a year ago1
Hyperparams for D4RL Mujoco tasks
Updated a year ago
For Morel, you truncate the uncertain rollouts instead of setting the negative reward?
Updated 2 years ago
Not learning reward?
Updated 3 years ago3
NPG kl_mean is always 0
Updated 3 years ago
why are the advantages multiplied by 1e-2 in dapg.py?
Closed 3 years ago1
difference between `a is b` and `a == b`
Closed 3 years ago
missing packages in conda env build
Closed 3 years ago1
Unable to complete the installation from the provided yml file
Closed 3 years ago1
Pickling of _VariableFunctions not compatible with PyTorch 1.5.0
Closed 3 years ago1
RuntimeError: CUDA out of memory.
Closed 3 years ago
Actions not clipped when generating synthetic trajectories
Updated 3 years ago
Unnecessary imports. Can be cleaned
Updated 4 years ago
Sampler timeouts when running many concurrent trainings
Closed 4 years ago
No RBF code?
Updated 4 years ago
Much worse learning performance with new code base
Closed 4 years ago4
[Question] Meaning of the "al" variable?
Closed 5 years ago4
Value function approximator
Closed 6 years ago1
linear policy?
Closed 6 years ago5
Is mean KL always zero?
Closed 6 years ago1
Small typos in the readme
Closed 6 years ago