zuzannapiekarczyk / ReinfocementLearningClasses

Reinfocement Learning Classes repository

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Reinfocement Learning Classes repository

This is the repository with the tasks from my Reinfocement Learning course. All the tasks are implemented in Python, most in Jupyter Notebooks. To solve the majority of the tasks it is recommended to listen to the lecture or watch the corresponding YouTube Video.

In my course I am based on the great Reinforcement Learning: An Introduction by R. S Sutton and A. G. Barto book.

Another great resources are:

Solutions

Generally speaking I am not publishing solutions here. If you need my solutions, feel free to contact me via e-mail.

Current content (in learning order)

  • kArmedBandit - implementation of three basic agents (e-Greedy, Optimistic, and UCB) for estimating state-action values in k-Armed Bandit problem.
  • DynamicProgramming - a Dynamic Programming for a Maze environment (environment is also in this folder)
  • MonteCarlo - A Monte Carlo for the OpenAI Black Jack environment
  • TD_0 - A Monte Carlo and TD(0) for the Walking environment
  • QLearning - SARSA and Q-learning for both Cliff Walking and Taxi environments
  • DQLearning - Deep Q Learning. For now it containes two environments (upgrades soon):
    • Cart Pole - Basic DQN taks
    • Mountain Car - Like the previous one, but with local and target neural networks

Disclamers

This repository is under construction, new content will appear continuously.

Although I'll do my best to mark every place where I am using someone's else ideas or code, sometimes it may be hard. I have a lot of code that I implemented learning from other courses, and I am not sure if I remember where did I take all my code and ideas from. If you feel that I missed any references, please, let me know.

This repository is only a part of the course, prepared for sharing with my students. Feel free to use it as you wish, but without the rest of the course, it may not be so useful for you.

About

Reinfocement Learning Classes repository


Languages

Language:Jupyter Notebook 66.0%Language:Python 34.0%