The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence) cover image

Language Modeling With State Space Models with Dan Fu - #630

The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

00:00

Challenges and Innovations in Language Modeling

This chapter explores the difficulties of using state space models for language tasks, emphasizing their limitations during training and inference. It highlights innovative solutions such as stacking state space models to improve performance and comparing their effectiveness to traditional transformer models. The discussion also addresses the development of generative models, measuring their performance against established benchmarks while noting strengths and weaknesses.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app