holoiii / cfr

Counterfactual Regret Minimization

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

CFR: Counterfactual Regret Minimization

License

Python library implementing various counterfactual regret minimization algorithms for imperfect information games.

Disclaimer

This library is currently in a pre-alpha stage. All code is subject to change without prior notice.

Roadmap

The aim is to make a fully generic self-play algorithm that can be configured with the following parameters

  • Update methods:
    • CFR
    • CFR+
    • Fictitious play
    • Hedge
  • Graph methods:
    • Full width ("vanilla")
    • Chance sampling ("Monte Carlo")
  • Optimizations:
    • Fixed strategy iteration
    • Pruning
  • Games:
    • Rock-Paper-Scissors
    • Liar Die
    • (optional, no promises!) any game in Gambit EFG format through its PyAPI
    • (long-term goal) Stratego
Acknowledgments

This library was inspired by Todd W. Neller's and Marc Lanctot's lecture notes "An Introduction to Counterfactual Regret Minimization". Their lecture notes, slides and original Java implementations for the games of Rock-Paper-Scissors, Kuhn poker, Dudo and Liar Die have been added to this repository for archival purposes.

License

Copyright Rein Halbersma 2018.
Distributed under the Boost Software License, Version 1.0.
(See accompanying file LICENSE_1_0.txt or copy at http://www.boost.org/LICENSE_1_0.txt)

About

Counterfactual Regret Minimization

License:Boost Software License 1.0


Languages

Language:Jupyter Notebook 79.5%Language:Java 12.6%Language:Python 7.8%