Unity ML-Agents Toolkit

(latest release) (all releases)

The Unity Machine Learning Agents Toolkit (ML-Agents) is an open-source project that enables games and simulations to serve as environments for training intelligent agents. We provide implementations (based on PyTorch) of state-of-the-art algorithms to enable game developers and hobbyists to easily train intelligent agents for 2D, 3D and VR/AR games. Researchers can also use the provided simple-to-use Python API to train Agents using reinforcement learning, imitation learning, neuroevolution, or any other methods. These trained agents can be used for multiple purposes, including controlling NPC behavior (in a variety of settings such as multi-agent and adversarial), automated testing of game builds and evaluating different game design decisions pre-release. The ML-Agents Toolkit is mutually beneficial for both game developers and AI researchers as it provides a central platform where advances in AI can be evaluated on Unity’s rich environments and then made accessible to the wider research and game developer communities.

프로젝트 Description

Unity에서 제공한 ml-agent 라이브러리의 PPO 알고리즘을, Python-API에서 구현한 DQN 알고리즘으로 성능(mean reward)을 재현

0. 환경 설정

패키지 다운로드

conda install -y python==3.8
pip install mlagents==0.26.0

conda install cuda -c nvidia
pip3 install torch torchvision torchaudio --extra-index-url https://download.pytorch.org/whl/cu116

Unity에서 Build한 환경 다운로드

동작 OS : Window
다운로드 주소 : https://drive.google.com/file/d/1kEUlAdeUz5sU4W6UWxVpUMv2bGJEmgNt/view?usp=sharing
다운로드한 파일을 해당 repository({경로}/RLWithUnity/)에 압축해제

1. ml-agent실행

해당 Repository clone
프로젝트 설치 후 강화학습을 위한 유니티 환경 Build(PPT 참고)
Build한 유니티 환경에 대한 Model 학습 진행

# PPO(Proximal Policy Optimization)
mlagents-learn {trainer_path} --env={env_path}/{build_name} --run-id={run_id}

mlagents-learn config/ppo/3DBall.yaml --env=./project_unity_hub/ball3d/ball3d_env/ball3d.exe --run-id=rl_project

-- PPO algorithm Hyperparameter에 관한 사항은 ./config/ppo/3DBall.yaml

build한 환경 확인

실행
- 학습 전 : ./RLWithUnity/project_unity_hub/ball3d/ball3d_env/ball3d.exe 실행
- PPO 학습 후 : ./RLWithUnity/project_unity_hub/ball3d_ppo_training_result/ppo/ppo_env/ppo.exe 실행

2. Python-API DQN 실행

# DQN 학습
python RL_Trainer_dqn.py

# DQN 학습 결과 Test
python RL_Tester_dqn.py  # 단 RL_Tester_dqn에서 model_load 경로를 수정해주어야 함.

References

folk ml-agents @ release_17

About

The Unity Machine Learning Agents Toolkit (ML-Agents) is an open-source project that enables games and simulations to serve as environments for training intelligent agents using deep reinforcement learning and imitation learning.

https://unity.com/products/machine-learning-agents

Other

Languages

Language:C# 55.1%Language:Python 42.2%Language:Jupyter Notebook 2.3%Language:ShaderLab 0.3%Language:Shell 0.1%Language:Batchfile 0.1%Language:Dockerfile 0.0%Language:C 0.0%