wpfvs

execute

topics we have to or could cover

What our program does and benchmark analysis (how well we improved using the cluster)
The programs architecture
Our source code (listing for the documentation)
Tensorflow
Python
OpenAI Gym
Neural Nets
(RabbitMQ)
(Numpy) (fast linear algebra in python)
...

openai gym for training a RL agent.
distributed multi-layered ai cluster architecture for a distributed system.
awesome-mashine-learning a lot of reading material.
awesome-reinforcement-learning for stuff about reinforcement learning.
multi-agent system theoretical model of something we could build.
deep q learning holy sh*t mathematical but basically describes how our agent works and how to optimize it further.
double deep q learining
time to beat LunarLander-v2 22 minutes.

wpf vs 2018

Language:Python 100.0%