Reinforcement Learning: An Introduction

Python replication for Sutton & Barto's book Reinforcement Learning: An Introduction (2nd Edition)

If you have any confusion about the code or want to report a bug, please open an issue instead of emailing me directly. And unfortunately I do not have exercise answers for the book.

Figure 9.1: Gradient Monte Carlo algorithm on the 1000-state random walk task
Figure 9.2: Semi-gradient n-steps TD algorithm on the 1000-state random walk task
Figure 9.5: Fourier basis vs polynomials on the 1000-state random walk task
Figure 9.8: Example of feature width’s effect on initial generalization and asymptotic accuracy
Figure 9.10: Single tiling and multiple tilings on the 1000-state random walk task

Chapter 10

Chapter 11

Chapter 12

Chapter 13

Environment

python 3.6
numpy
matplotlib
seaborn
tqdm

Usage

All files are self-contained

python any_file_you_want.py

Contribution

If you want to contribute some missing examples or fix some bugs, feel free to open an issue or make a pull request.

Following are missing figures/examples:

Figure 12.14: The effect of λ

About

Python Implementation of Reinforcement Learning: An Introduction

MIT License

Languages

Language:Python 100.0%

smallGum / reinforcement-learning-an-introduction

Reinforcement Learning: An Introduction

Contents

Chapter 1

Chapter 2

Chapter 3

Chapter 4

Chapter 5

Chapter 6

Chapter 7

Chapter 8

Chapter 9