No Priors: Artificial Intelligence | Technology | Startups cover image

No Priors: Artificial Intelligence | Technology | Startups

State Space Models and Real-time Intelligence with Karan Goel and Albert Gu from Cartesia

Jun 27, 2024
Karan Goel, co-founder of Cartesia and inventor of State Space Models at Stanford AI Lab, joins forces with fellow co-founder Albert Gu to discuss groundbreaking advancements in real-time intelligence. They dive into their product Sonic, a text-to-speech engine boasting unparalleled speed and quality. The duo contrasts State Space Models with traditional transformers, emphasizing the efficiency of their innovations. Listeners will also enjoy insights on the future of intelligent systems, emotional speech tech, and the aesthetics of tackling research challenges.
34:08

Episode guests

Podcast summary created with Snipd AI

Quick takeaways

  • Sonic by Cartesia revolutionizes text-to-speech with ultra-low latency, enhancing interactive gaming experiences.
  • State-space models (SSMs) outperform Transformer architectures in efficiency and adaptability, offering fundamental new primitives for diverse applications.

Deep dives

Revolutionizing Interactive Text-to-Speech with Sonic Engine

Cartija's founders, Karen Gole and Albert Gu, introduced Sonic, a high-speed text-to-speech engine ideal for interactive, low-latency generation. The engine's applications in gaming, enabling millions of players to interact with characters and voice agents, showcase its groundbreaking impact. Sonic's goal is to continually reduce latency, with a focus on enhancing user experience in gaming and voice-related technologies.

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner