PingchuanMa / Respect-Learning

No Respect - No Learning

Respect-Learning

Setup

After following setup procedure on official website, run setup.sh for installing required packages.

Usage

Run run.py for local training and testing, run submit.py for online submission.

To-dos

Simulator

Understand observation, especially physical properties of joint
Figure out the reason why action repeat slows things down so much

Reward Shaping

Encourage bending knees
Activate muscle more for speed up
Encourage more aggressive stepping, currently the agent is too cautious to make a big step

RL Tricks

Discretize action value
Action noise & parameter noise
Layer normalization
Observation engineering

Benchmarks

Network structure
Activation function (SELU, RELU, ELU, ...)
1D conv / FC

About

No Respect - No Learning

Languages

Language:Python 99.7%Language:Shell 0.3%