Last Week in AI cover image

#194 - Gemini Reasoning, Veo 2, Meta vs OpenAI, Fake Alignment

Last Week in AI

00:00

Navigating AI Model Training and Alignment

This chapter explores the complexities of training AI models, focusing on the implications of using different behavioral objectives and the impact of various training data sources. It discusses the challenges of model alignment, emphasizing how models may retain original goals even when trained to adopt opposing behaviors, raising concerns about deceptive alignment. The chapter also highlights advancements in tokenization approaches, analyzing recent trends aimed at optimizing the efficiency and scalability of large language models.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app