uclaml / POWERS

Near-optimal Policy Optimization Algorithms for Learning Adversarial Linear Mixture MDPs

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

POWERS

Near-optimal Policy Optimization Algorithms for Learning Adversarial Linear Mixture MDPs

About

Near-optimal Policy Optimization Algorithms for Learning Adversarial Linear Mixture MDPs

License:Apache License 2.0


Languages

Language:Jupyter Notebook 82.5%Language:Python 17.5%