Interconnects cover image

Interviewing Tri Dao and Michael Poli of Together AI on the future of LLM architectures

Interconnects

00:00

Advancements in Machine Learning Architectures

This chapter explores the cutting-edge developments in machine learning, particularly the interplay between linear attention mechanisms and state space models. It discusses the implications of innovative architectural choices, hybridization of model components, and the challenges of implementing new kuda kernels in GPU systems. The conversation also covers the future of model architectures, especially in light of the ongoing debate on the dominance of Transformer models versus emerging alternatives.

Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner