Latent Space: The AI Engineer Podcast cover image

2024 in Post-Transformers Architectures (State Space Models, RWKV) [LS Live @ NeurIPS]

Latent Space: The AI Engineer Podcast

00:00

Innovations in CUDA Libraries and Next-Gen Model Architectures

This chapter introduces the CUDA library Thunder Kittens, focusing on its role in simplifying matrix operations crucial for model design on H100 hardware. It also delves into the advancement of next-gen models in video generation and context management, exploring optimization strategies for real-time performance.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app