2prime / pytorch-gym

Implementation of the Deep Deterministic Policy Gradient(DDPG) in bullet Gym using pytorch

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

DDPG in bullet Gym using pytorch

Overview

This is an implementation of Deep Deterministic Policy Gradient (DDPG) in bullet Gym using PyTorch.

Dependencies

  • Python 3.6.2
  • pytorch 0.2.0
  • gym
  • tensorboardX-1.0
  • pybullet (if you want to train agents for bullet env)

Run

  • here is a simple example to train CartPole with high efficiency:

$ cd base

$ python main.py --debug --discrete --env=CartPole-v0 --vis

  • you can use this to understand usage of each argument:

$ python main.py --help

  • some explanation of important arguments:

--debug: print the reward and some other information

--discrete: if the actions are discrete rather than continuous

--vis: render each action (but it would slow down your training speed)

--cuda: train this task using GPU

--test: testing mode

--resume : load model from the path

Contributors

About

Implementation of the Deep Deterministic Policy Gradient(DDPG) in bullet Gym using pytorch


Languages

Language:Python 66.5%Language:Jupyter Notebook 33.5%