eeshadutta / Markov-Decision-Process

MDPs solved using Value Iteration and Linear Programming

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Markov Decision Process

A Markov decision process (MDP) is a discrete time stochastic control process. It provides a mathematical framework for modeling decision making in situations where outcomes are partly random and partly under the control of a decision maker. MDPs are useful for studying optimization problems solved via dynamic programming and reinforcement learning.

There are three methods to solve MDPs-

  • Value Iteration
  • Policy Iteration
  • Linear Programming

This assignment uses Value Iteration and Linear Programming to solve MDP.

About

MDPs solved using Value Iteration and Linear Programming


Languages

Language:Python 100.0%