Latent Space: The AI Engineer Podcast cover image

[Cognitive Revolution] The Tiny Model Revolution with Ronen Eldan and Yuanzhi Li of Microsoft Research

Latent Space: The AI Engineer Podcast

00:00

Exploring Emergence and Interpretability in Language Models

This chapter explores the complexities of emergence in artificial intelligence, focusing on how large language models learn in comparison to human children. It examines structural trade-offs in transformer models, the role of attention heads, and proposes strategies like curriculum learning to improve model training.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app