phuongboi / lunar-lander

Solve lunar lander problem with reinforcement learning

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

This repo contain some of my re-implement of lunar lander with reinforcement learning algorithm

Lunar lander v1

  • Solve lunar lander problem from openai Gymnasium use Q-learning and experience replay memory [2].
  • The implementation base on fakemonk1 [1] and references from juliankappler [3]
  • Friendly and simple implementation with pytorch
  • Run python lunar_lander_v1.py
Training result

alt text

Lunar lander v2

  • Solve lunar lander problem from openai Gymnasium [2] use Q-learning.
  • Periodly update q_target network parameter [4]
  • Use softmax policy instead of epsilon greedy policy
  • Multiple training step from replay memory

Reference

About

Solve lunar lander problem with reinforcement learning


Languages

Language:Python 100.0%