Machine Learning Street Talk (MLST) cover image

#030 Multi-Armed Bandits and Pure-Exploration (Wouter M. Koolen)

Machine Learning Street Talk (MLST)

00:00

Defining Contracts in Multi-Armed Bandit Problems

This chapter explores the complexities of establishing a minimal contract for multi-armed bandit problems through a domain-specific language tailored to express assumptions and conditions. It incorporates parametric assumptions, discusses the Kuhlbeck-Leibler divergence for generalization, and details how to formulate queries for optimal arm selection using linear inequalities.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app