Latent Space: The AI Engineer Podcast cover image

2024 in Post-Transformers Architectures (State Space Models, RWKV) [LS Live @ NeurIPS]

Latent Space: The AI Engineer Podcast

00:00

Exploring the Efficiency and Challenges of Advanced AI Architectures

This chapter discusses the complexities of training large AI models, with a specific focus on VRAM limitations and the efficiency of smaller models. It also looks into the future potential of models to manage infinite context lengths and the importance of developing suitable benchmarks for performance assessment.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app