Latent Space: The AI Engineer Podcast cover image

2024 in Post-Transformers Architectures (State Space Models, RWKV) [LS Live @ NeurIPS]

Latent Space: The AI Engineer Podcast

CHAPTER

Exploring the Efficiency and Challenges of Advanced AI Architectures

This chapter discusses the complexities of training large AI models, with a specific focus on VRAM limitations and the efficiency of smaller models. It also looks into the future potential of models to manage infinite context lengths and the importance of developing suitable benchmarks for performance assessment.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner