Generally Intelligent cover image

Episode 33: Tri Dao, Stanford: On FlashAttention and sparsity, quantization, and efficient inference

Generally Intelligent

CHAPTER

Introduction

TreeDow is a PhD student at Stanford code buys by Stefano Erman and Chris Ray. He'll be joining Princeton as an assistant professor next year. TreeDow: I think there are many paths to a high-performing language models. We hope you learn as much as we have in our quest to understand and build the mind.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner