inoryy / bsc-thesis

Replicating DeepMind StarCraft II Reinforcement Learning Benchmark with Actor-Critic Methods

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Bachelor’s thesis

  • Title: Replicating DeepMind StarCraft II Reinforcement Learning Benchmark with Actor-Critic Methods
  • Author: Roman Ring
  • Submission date: 8th May, 2018
  • Supervisors: Ilya Kuzovkin, Tambet Matiisen
  • Institution: Institute of Mathematics and Statistics, Faculty of Science and Technology, University of Tartu
  • Abstract: Reinforcement Learning (RL) is a subfield of Artificial Intelligence (AI) that deals with agents navigating in an environment with the goal of maximizing total reward. Games are good environments to test RL algorithms as they have simple rules and clear reward signals. Theoretical part of this thesis explores some of the popular classical and modern RL approaches, which include the use of Artificial Neural Network (ANN) as a function approximator inside AI agent. In practical part of the thesis we implement Advantage Actor-Critic RL algorithm and replicate ANN based agent described in [Vinyals et al., 2017]. We reproduce the state-of-the-art results in a modern video game StarCraft II, a game that is considered the next milestone in AI after the fall of chess and Go.
  • Full text: Digital version
  • Implemented agent: https://github.com/inoryy/pysc2-rl-agent

[Vinyals et al., 2017] Vinyals, O., Ewalds, T., Bartunov, S., Georgiev, P., Sasha Vezhnevets, A., Yeo, M., Makhzani, A., Küttler, H., Agapiou, J., Schrittwieser, J., Quan, J., Gaffney, S., Petersen, S., Simonyan, K., Schaul, T., van Hasselt, H., Silver, D., Lillicrap, T., Calderone, K., Keet, P., Brunasso, A., Lawrence, D., Ekermo, A., Repp, J., and Tsing, R. (2017). StarCraft II: A New Challenge for Reinforcement Learning.

About

Replicating DeepMind StarCraft II Reinforcement Learning Benchmark with Actor-Critic Methods


Languages

Language:TeX 100.0%