Slide 1R EINFORCEMENT L EARNING Slide 2 A GENDA Online learning Reinforcement learning Model-free vs. model-based Passive vs. active learning Exploration-exploitation tradeoff…