Algorithmic and Theoretical Foundations of RL

Course Description

This course aims to provide an introduction to basic algorithmic and theoretical ingredients behind reinforcement learning, including Markov decision process, value iteration and policy iteration, Monte Carlo and temporal-difference learning, value function approximation, policy optimization, and online planning.

Main References

Reinforcement learning: an introduction by Richard S. Sutton and Andrew G. Barto, 2018
Mathematical foundation of reinforcement learning by Shiyu Zhao, 2022
Algorithms for decision making by Mikel J. Kochenderfer, Tim A. Wheeler, and Kyle H. Wray, 2022
Introduction to Reinforcement Learning by David Silver, 2015