GDG-WorkShop
Reinforcement Learning for everyone
Google Developer Group
import gym
env = gym.make("CartPole-v1")
observation, info = env.reset(seed=42, return_info=True)
for _ in range(1000):
env.render()
# action = policy(observation) # User-defined policy function
observation, reward, done, info = env.step(1)
if done:
observation, info = env.reset(return_info=True)
env.close()
Keynote: https://bit.ly/3a3UHNI