Lex Fridman Podcast cover image

#459 – DeepSeek, China, OpenAI, NVIDIA, xAI, TSMC, Stargate, and AI Megaclusters

Lex Fridman Podcast

CHAPTER

Advancements in AI Model Architectures

This chapter discusses the latest innovations in artificial intelligence model architectures, focusing on efficiency in training and inference through techniques like the mixture of experts model and latent attention. It also examines the complexities of implementing these advancements and the challenges faced in optimizing AI training, particularly with limited GPU resources.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner