Generative Adversarial Imitation Learning

Implementation of Generative Adversarial Imitation Learning(GAIL) using tensorflow

Dependencies

python>=3.5
tensorflow>=1.4
gym>=0.9.3

Env==CartPole-v0
State==Continuous
Action==Discrete

Train experts

python3 run_ppo.py

Sample trajectory using expert

python3 sample_trajectory.py

Run GAIL

python3 run_gail.py

Run supervised learning

python3 run_behavior_clone.py

Test trained policy

python3 test_policy.py

Default policy is trained with gail
--alg=bc or ppo allows you to change test policy

If you want to test bc policy, specify the number of model.ckpt-number in the directory trained_models/bc
Example

python3 test_policy.py --alg=bc --model=1000

Tensorboard

tensorboard --logdir=log


Fig.1 Training results	legend

MIT LICENSE

Tensorflow implementation of Generative Adversarial Imitation Learning(GAIL) with discrete action

MIT License

Language:Python 100.0%