IntrinsicallyEfficientStableOPE
Code for Intrinsically Efficient, Stable, and Bounded Off-Policy Evaluation for Reinforcement Learning
CB OPE (Tables 2--4)
Run
seq 4 | xargs -L 1 -P 4 ./script2.sh
in the appropriate folder.
RL OPE (Tables 5--7)
Run
- windy.py
- cliff.py
- mountain.py