hardmaru / WorldModelsExperiments

World Models Experiments

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

the training of CMA-ES shows high average reward but when you just check the model against the log the rewards are practically zero

itabhiyanta opened this issue · comments

Hi @hardmaru

Thanks for posting this repo. i have a strange issue I see a very promising curve for the training of my CMA-ES model however i cannot replicate the results when i execute the following command.

python3.5 model.py log/filewiththe best stats.json
dispatch

I am using a custom environment.

I also wish to ask you something about the number of processors for the training of the CMA-ES model. I used 16 processors and also 48 processors (couldn't use 64 processors as then i run out of memory). Do you think reducing the number of processors for training of the CMA-ES model will have some adverse effect?

Kindly advise.
Rohit

yep that was it. i didn't use it thinking that since i do not use the gym environment in general it doesn't apply to me.
thanks

cool. I'd be interested to see any results for custom environments, looking forward to see your publications in the future.