filippogiruzzi / reinforcement_learning_resources

Personal Deep Reinforcement Learning class notes

reinforcement-learning deep-reinforcement-learning uc-berkeley cs285 reinforcement-learning-algorithms deep-learning artificial-intelligence deep-neural-networks deeplearning control-theory imitation-learning behaviour-cloning actor-critic q-learning deep-q-learning deep-q-network deep-q-learning-network policy-gradient model-based-reinforcement-learning

Reinforcement Learning resources

Keywords: Deep Reinforcement Learning, UC Berkeley

This repository contains some personal Deep Reinforcement Learning class notes in .pdf format.

Table of contents

Class notes content
1.1 CS285 (UC Berkeley) - Deep Reinforcement Learning by Sergey Levine
Resources

1. Class notes content

1.1 CS285 (UC Berkeley) - Deep Reinforcement Learning by Sergey Levine

These notes summarize the main Reinforcement Learning algorithms, both in theory and in practice with some tips & hacks for efficient implementation.

1. Introduction

WIP

2. Supervised Learning of behaviors

Goal
Algorithms
2.1 DAgger: Dataset Aggregation
Tips & hacks

3. Introduction to Reinforcement Learning

Goal
Algorithms
2.1 Global structure
2.2 Exemples

4. Policy Gradients

Algorithms
1.1. REINFORCE
Tips & hacks

5. Actor-Critic algorithms

Algorithms
1.1. Batch Actor-Critic
1.2. Online Actor-Critic
Tips & hacks

6. Value function methods

Algorithms
1.1. Policy iteration
1.2. Policy iteration with Dynamic programming
1.3. Value iteration
1.4. Fitted Value iteration
1.5. Fitted Q-iteration
1.6. Online Q-iteration
Tips & hacks

7. Deep Reinforcement Learning with Q-functions

Algorithms
1.1. Q-learning with replay buffer
1.2. Q-learning with replay buffer and target network
1.3. DQN: classic Deep Q-learning
1.4. DDPG: Q-learning for continuous actions
Tips & hacks

8. Advanced Policy Gradients

WIP

9. Model-based planning

WIP

10. Model-based Reinforcement Learning

Algorithms
1.1. Model-based Reinforcement Learning version 0.5
1.2. Model-based Reinforcement Learning version 1.0
1.3. Model-based Reinforcement Learning version 1.5
1.4. Model-based Reinforcement Learning with latent space models
Tips & hacks

11. Model-based Policy Learning

Algorithms
1.1. Model-based Reinforcement Learning version 2.0
1.2. DYNA: online Q-learning model-free Reinforcement Learning with a model
1.3. General DYNA-style model-based Reinforcement Learning
1.4. MBA: Model-based Acceleration – MVE: Model-based Value Ex- pansion – MBPO: Model-based Policy Optimization
1.5. Divide and Conquer Reinforcement Learning
Tips & hacks

12. Variational Inference & Generative models

WIP

13. Control as inference

Algorithms
1.1 Soft Q-learning

14. Inverse Reinforcement Learning

Algorithms
1.1. Maximum Entropy Inverse Reinforcement Learning
Tips & hacks

15. Transfer & Multi-task Learning

Tips & hacks

16. Distributed Reinforcement Learning

WIP

17. Exploration

Algorithms
1.1. Pre-train & finetune
Tips & hacks

18. Meta-learning

WIP

19. Information theory

WIP

2. Resources

These notes were widely inpspired by:

CS285 (UC Berkeley) - Deep Reinforcement Learning

About

Personal Deep Reinforcement Learning class notes

reinforcement-learning deep-reinforcement-learning uc-berkeley cs285 reinforcement-learning-algorithms deep-learning artificial-intelligence deep-neural-networks deeplearning control-theory imitation-learning behaviour-cloning actor-critic q-learning deep-q-learning deep-q-network deep-q-learning-network policy-gradient model-based-reinforcement-learning