- trajectory/ storage trajectories of expert like (s,a)
python run.py
rewards:
dscriminator loss:
learning robust rewards with adversarial inverse reinforcement learning
learning robust rewards with adversarial inverse reinforcement learning
python run.py
rewards:
dscriminator loss:
learning robust rewards with adversarial inverse reinforcement learning
learning robust rewards with adversarial inverse reinforcement learning