Agent learns the function target = sin(x − 3) cos(y) + N(0, 0.1), where N is randomly destributed with mean = 0 and standard dev = 0.1
python 2.7
conda install matplotlib
conda create --name python27 python=2.7
source activate python27
conda install matplotlib
python SuperLearn.py
should output f10000
python2 plot.py f10000
This is an assignment from a class on Reinforcement Learning, taught by Richard Sutton, the father of Reinforcement Learning