the training of CMA-ES shows high average reward but when you just check the model against the log the rewards are practically zero
itabhiyanta opened this issue · comments
Hi @hardmaru
Thanks for posting this repo. i have a strange issue I see a very promising curve for the training of my CMA-ES model however i cannot replicate the results when i execute the following command.
python3.5 model.py log/filewiththe best stats.json
I am using a custom environment.
I also wish to ask you something about the number of processors for the training of the CMA-ES model. I used 16 processors and also 48 processors (couldn't use 64 processors as then i run out of memory). Do you think reducing the number of processors for training of the CMA-ES model will have some adverse effect?
Kindly advise.
Rohit
yep that was it. i didn't use it thinking that since i do not use the gym environment in general it doesn't apply to me.
thanks
cool. I'd be interested to see any results for custom environments, looking forward to see your publications in the future.