CS 285 Final Project - Fall 2023

Self-play of a PPO agent trained with full parameter sharing on our custom Chinese Checkers environment.

In this project, we analyze MARL parameter sharing techniques for multi-agent competitive environments. We create a new custom environment for Chinese checkers and train PPO agents through three different approaches:

Full parameter sharing - Agents share all parameters in the network.
Shared encoder - Agents share an encoder but have unique policy and value function heads.
Fully independent - Agents share no parameters across their networks.

About

MIT License

Languages

Language:Jupyter Notebook 82.0%Language:Python 11.7%Language:TeX 6.3%