Multi-Armed Bandit Reading Group

In July 2020, along with my advisor, I initiated a reading group to understand the area of Multi-Armed Bandits. The discussions are largely adapted from the recently online book on Bandit Algorithms. Written notes for the meetings are provided below.

Lecture 1: Introduction to stochastic multi-armed (finite) bandits, explore-then-commit, UCB.
Lecture 2: Asymptotic optimality of UCB, MOSS, Adversarial bandit, and Exp3 algorithm.
Lecture 3: Exp3-IX algorithm.
Lecture 4: Contextual bandits, bandits with expert advice, Exp4 algorithm.
Lecture 5: Stochastic Linear bandits, LinUCB.
Lecture 6: Notebook explaining algorithms (under prep)
Lecture 7: Bandit PCA.