Interconnects cover image

Interviewing Tri Dao and Michael Poli of Together AI on the future of LLM architectures

Interconnects

00:00

Advancements in Machine Learning Architectures

This chapter explores the cutting-edge developments in machine learning, particularly the interplay between linear attention mechanisms and state space models. It discusses the implications of innovative architectural choices, hybridization of model components, and the challenges of implementing new kuda kernels in GPU systems. The conversation also covers the future of model architectures, especially in light of the ongoing debate on the dominance of Transformer models versus emerging alternatives.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app