A simple implementation of Deep Q learning that uses Tensorflow 2 and Deepmind's Reverb replay buffer.