The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence) cover image

Language Understanding and LLMs with Christopher Manning - #686

The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

CHAPTER

Innovations in Language Model Alignment and Architectural Approaches

This chapter explores Direct Preference Optimization (DPO) as a simplified method for aligning language models, highlighting its resource efficiency compared to traditional reinforcement learning. It also examines new architectural concepts influenced by human learning patterns, focusing on locality and hierarchy in AI development.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner