control inverted-pendulum reinforcement-learning underactuated trajectory-optimization

Underactuated Systems (under development)

This repository contains modified OpenAI gym environments for continuous control. The purpose of this project is to create environments that can be used to prototype/experiment with optimal control and reinforcement learning algorithms.

Enviroments

So far, working environments include the Pendulum, Inverted Pendulum (Cartpole), Dual Inverted Pendulum, Acrobot, and Pendubot. Environments currently under development include the Double Inverted Pendulum and Acrowheel (i.e. wheeled inverted pendulum + acrobot).

Dynamics

All of the equations of motion have been derived by hand. Using the standard manipulator form for robot dynamics, we keep track of the inertial mass matrix M, the Coriolis matrix C, gravity vector G, and a vector/array B that maps torque inputs to generalized coordinates. By using the manipulator equations, we can easily obtain the linearized dynamics equations.

Numerical Integration

We currently use a Semi-implicit Euler method for integrating the dynamics. This is more stable than the standard Euler method and the Verlet method, both of which diverge pretty quickly. We will probably implement a fourth order Runge-Kutta method in the future.

Contributions

Contributions are greatly appreciated.

About

Modified OpenAI gym environments for continuous control of underactuated systems

control inverted-pendulum reinforcement-learning underactuated trajectory-optimization

Languages

Language:Python 100.0%