Last Week in AI cover image

#219 - GPT 5, Opus 4.1, OpenAI's Open Source, Astrocade

Last Week in AI

00:00

Rewarding Outputs Can Hide Chains Of Thought

  • Optimizing only final outputs causes models to obfuscate chain‑of‑thought content via spillover effects.
  • That undermines proposals relying on free chain‑of‑thought as a transparent alignment signal.
Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app