Generally Intelligent cover image

Episode 33: Tri Dao, Stanford: On FlashAttention and sparsity, quantization, and efficient inference

Generally Intelligent

00:00

Introduction

TreeDow is a PhD student at Stanford code buys by Stefano Erman and Chris Ray. He'll be joining Princeton as an assistant professor next year. TreeDow: I think there are many paths to a high-performing language models. We hope you learn as much as we have in our quest to understand and build the mind.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app