Mixture of Experts cover image

GPT-5.2 code red & AWS Nova models drop

Mixture of Experts

00:00

Measuring agent reliability and evaluation

Panelists emphasize accuracy, eval loops, and context summarization as keys to longer-running reliable agents.

Play episode from 39:43
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app