Deep and Reinforcement Learning UPC Reinforcement Learning and Deep Learning LAB MODEL FREE SARSA, Q-LEARNING, E-SARSA, DOUBLE Q-LEARNING MDP AND DYNAMIC PROGRAMING Multi_Armed_Bandit.ipynb MAB: EXPLORATION AND EXPLOITATION. (Greedy | UCB) Generative Adversarial Networks Transfer Learning