akjayant / Coding_Reinforcement_Learning

Implementation of basic RL steps and algorithms - Dynamic Programming approach, Monte-Carlo approach, DQN on Atari, Policy Gradient - Reinforce with baseline, Actor Critic (A2C)

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Coding the RL elements

Implementation of basic RL steps and algorithms with my personal snippets/notes in jupyter notebook.

References -

  1. Intro to RL - Sutton & Barto
  2. Denny Britz RL Repo - blackjack.py, gridworld.py, plotting.py

About

Implementation of basic RL steps and algorithms - Dynamic Programming approach, Monte-Carlo approach, DQN on Atari, Policy Gradient - Reinforce with baseline, Actor Critic (A2C)

License:MIT License


Languages

Language:Jupyter Notebook 95.9%Language:Python 4.1%