max-bellman-toy Code for gold mining environment described in the "Maximum Reward Formulation In Reinforcement Learning" paper