Upcoming features:
- Optimizer for the baseline will train multiple times on the current trajectory, and feature weight decay.
- Generalized Advantage Estimation.
- Extensive built-in logging.
- Ability to start at a specified state.
A framework for reinforcement learning, optimal control and trajectory optimization.
Upcoming features:
A framework for reinforcement learning, optimal control and trajectory optimization.
The Unlicense