AntonioAlgaida / Playground

In this repository I will try different algorithms and play with them.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Playground

In this repository I will try different algorithms and play with them.

Playground 0

I have been playing with Stable_Baselines3 and the Lunar_Lander_v2 environment.

Obtained an average reward of 270, training for 2e6 timesteps with the PPO algorithm.

See how well it works

About

In this repository I will try different algorithms and play with them.

License:MIT License


Languages

Language:Jupyter Notebook 100.0%Language:Python 0.0%