Machine Learning Street Talk (MLST) cover image

#030 Multi-Armed Bandits and Pure-Exploration (Wouter M. Koolen)

Machine Learning Street Talk (MLST)

00:00

Exploring Monte Carlo vs. Las Vegas Algorithms in Reinforcement Learning

This chapter examines the distinctions between Monte Carlo and Las Vegas algorithms within reinforcement learning, using AlphaGo as a case study. It highlights the balance between exploration and exploitation in decision-making scenarios and discusses the challenges in evaluating game states effectively. Additionally, the conversation introduces innovative concepts, such as the 'pure exploration compiler,' aimed at simplifying research methodologies and optimizing sampling strategies.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app