![Data-Driven Decision Processes_hi-res logo](/sites/default/files/styles/workshop_banner_sm_1x/public/2023-03/Data-Driven%20Decision%20Processes_hi-res.png.jpg?itok=WR1qG1Qx)
Abstract
This part of the tutorial will build upon the first part of the tutorial on MDPs and focus on reinforcement learning. We will present three algorithms: TD learning, Q-Learning and Natural Policy Gradient, and outline the key ideas behind obtaining finite-time performance bounds for each of these algorithms.