mistake-in-retro-contest-of-OpenAI

My nickname is mistake in retro contest. This is my source code of contest, and this maybe a bit messy, I'm sorry for it. I will reorgnanize it later.

I only try model of rainbow during the contest, so there is no PPO or jerk model in my code. Maybe I will add them later.

In order to adjust some parameters, some classes and functions in the project are copied from the anyrl-py(https://github.com/unixpickle/anyrl-py)

#about transfer learning

During the contest period, We tried the following two approaches to train the model of rainbow:

Design the Network only, then submit the job that parameters are not pre-train,
Design the Network and per-train the parameters locally (about 7 millions timesteps), then submit the job.

We choose the first way to train the model. It's score is about 2000 more than the second one. The reason might be that the second approach loses the randomization of learning after per-training.

#some optimizations base the OpenAI-baselines

After testing the parameter named Nstep, we found the best value is 4, we think it's a balance point between the current and the future.
The model using nature_cnn training 1 million steps costs about 9 hours,while the max limit time is 12 hours, which leads to an oppotunity to design a complex network to replace the nature_cnn. After testing a lot of models, we found the network with one layer of cnn added on nature_cnn is the best.
The parameter of target_interval which the baseline uses is 8192. We found it takes too long between two replace operations due to the near convergence of the training process after 1000 steps, so We choose 1024 as the final value of target_interval.

#Prerequisites

Python 3.4+ pip virtualenv

#Run

copy this project into your local
python3 -m venv your-virtualenv-name
source your-virtualenv-name/bin/activate
cd mistake-in-retro-contest-of-OpenAI/
./requirements.sh (if failed, try chmod a+x requirements.sh). At the last step, require your password.
if you want to train the model of Sonic games, you should have their *.md files, after getting the *.md files, execute "python -m retro.import ". more details(https://github.com/openai/retro)
cd src/main/ and python contest_main.py the process will start to train the model. If you want to change the game or save the model, you can motify the model/rainbow.py and utils/sonic_util.py
If you want to try transfer a pre-training model to a new game, you can copy files of model to src/main/transfer_model/ and motify the path in transfer.py. Finally, execute "python transfer.py"

zhuliquan / mistake-in-retro-contest-of-OpenAI

mistake-in-retro-contest-of-OpenAI

About

Languages