mwalton / ToM-hanabi-neurips19

Implementation of nested theory of mind belief estimation & implicit communication intrinsic rewards.

reinforcement-learning theory-of-mind hanabi neurips-2019 deep-reinforcement-learning

Hanabi ToM

Implementation of nested theory of mind belief estimation & implicit communication intrinsic rewards proposed in Theory of Mind for Deep Reinforcement Learning in Hanabi

Citation

@misc{fuchs2019theory,
      title={Theory of Mind for Deep Reinforcement Learning in Hanabi}, 
      author={Andrew Fuchs and Michael Walton and Theresa Chadwick and Doug Lange},
      year={2019},
      eprint={2101.09328},
      archivePrefix={arXiv},
      primaryClass={cs.AI}
}

About

Implementation of nested theory of mind belief estimation & implicit communication intrinsic rewards.

reinforcement-learning theory-of-mind hanabi neurips-2019 deep-reinforcement-learning

Apache License 2.0

Languages

Language:Python 97.1%Language:Shell 2.9%