Get the app
Albert Gu
Assistant professor at Carnegie Mellon University researching post-transformer architectures for multi-modal foundation models.
Best podcasts with Albert Gu
Ranked by the Snipd community
20 snips
Jul 17, 2024
• 58min
Mamba, Mamba-2 and Post-Transformer Architectures for Generative AI with Albert Gu - #693
chevron_right
In this discussion, Albert Gu, an assistant professor at Carnegie Mellon University, dives into his research on post-transformer architectures. He explains the efficiency and challenges of the attention mechanism, particularly in managing high-resolution data. The conversation highlights the significance of tokenization in enhancing model effectiveness. Gu also explores hybrid models that blend attention with state-space elements and emphasizes the groundbreaking advancements brought by his Mamba and Mamba-2 frameworks. His vision for the future of multi-modal foundation models is both insightful and inspiring.
Remember Everything You Learn from Podcasts
Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
Get the app