BLEE's repositories
Language:HTML000
CS131-Programming-Language
CS131-Ocaml-hw1
Language:OCaml000
Language:C000
239A-Reinforcement-Learning
This project investigates the intuitions/ideas behind Double DQN, and evaluate how much it can improve Q-value overestimation and agent performance. We aim to describe how the learning/update process in Double DQN ends up with better Q-value estimates and agent performance when comparing to that of DQN.
Language:MATLAB000