3min chapter

DataTalks.Club cover image

MLOps Architect - Danny Leybzon

DataTalks.Club

CHAPTER

Optimization Theory - Exploring the Search Space and Exploiting Full Time?

The Explore, Explore, Trade-Off and Optimization Theory is used in reinforcement learning. It says that if you've got a bounded amount of time over which you can optimize around a search space, your optimal strategy is to start off very exploratory then take advantage of what you have learned. Thompson Resampling is one of the easiest and best solutions to the multi-arm bandit problem. We evolve into being more exploitative as we get older.

00:00

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode