akaAlbo / deeprlbootcamp

Solution to the Deep RL Bootcamp labs from UC Berkeley

Home Page:https://sites.google.com/view/deep-rl-bootcamp/home

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Solutions to the Deep RL Bootcamp labs

  • Prelab: Set up your computer for all labs.
  • Lab 1: Markov Decision Processes. You will implement value iteration, policy iteration, and tabular Q-learning and apply these algorithms to simple environments including tabular maze navigation (FrozenLake) and controlling a simple crawler robot.
  • Lab 2: Introduction to Chainer. You will implement deep supervised learning using Chainer, and apply it to the MNIST dataset.
  • Lab 3: Deep Q-Learning. You will implement the DQN algorithm and apply it to Atari games.
  • Lab 4: Policy Optimization Algorithms. You will implement various policy optimization algorithms, including policy gradient, natural policy gradient, trust-region policy optimization (TRPO), and asynchronous advantage actor-critic (A3C). You will apply these algorithms to classic control tasks, Atari games, and roboschool locomotion environments.

About

Solution to the Deep RL Bootcamp labs from UC Berkeley

https://sites.google.com/view/deep-rl-bootcamp/home


Languages

Language:Jupyter Notebook 46.3%Language:Python 41.3%Language:JavaScript 6.5%Language:HTML 2.7%Language:CSS 1.9%Language:Shell 1.4%