The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence) cover image

Multimodal AI Models on Apple Silicon with MLX with Prince Canuma - #744

The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

00:00

Marvis: A Real-Time Local Voice Agent

  • Marvis is Prince's real-time speech agent family; first model is ~250M parameters and streams audio with ~80ms first-response latency.
  • They trained it from scratch (inspired by Sesame) and will open-source code, data, and models.
Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app