AI Safety Newsletter cover image

AISN #50: AI Action Plan Responses

AI Safety Newsletter

00:00

Navigating Proxy Gaming in AI Reasoning Models

This chapter explores the complexities of proxy gaming within AI reasoning models that use reinforcement learning. It highlights the trade-offs of applying optimization to COTS systems, the need for transparent thought processes to uncover misbehavior, and updates on relevant AI and regulatory advancements.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app