Latent Space: The AI Engineer Podcast cover image

Latent Space: The AI Engineer Podcast

Making Transformers Sing - with Mikey Shulman of Suno

Mar 14, 2024
Mikey Shulman, CEO and co-founder of the music generation startup Suno, shares his journey from finance to creating innovative AI-driven audio experiences. The discussion dives into the fascinating challenges of transforming text into music and the unique complexities tied to audio creation. They explore the balance between accessibility and artistry, the emotional depth AI can express, and even compose a humorous country tune about cloud computing hurdles. Shulman also highlights the evolving role of AI in music sampling and audience participation.
52:51

Episode guests

Podcast summary created with Snipd AI

Quick takeaways

  • Voice AI progress from formant synthesis to neural networks, enabling easy and nuanced speech generation.
  • Transformers applied to music generation, predicting music sequences akin to text tokens for innovative music creation.

Deep dives

Innovative Music Generation Approach with AI Models

The podcast episode delves into a conversation with Mikey Schumann about Suno, a music generation startup that has made waves in the industry. Mikey's background in physics and AI startups like Kensho adds depth to the discussion. Suno's unique approach to music generation involves using AI models similar to text transformers to predict the next tokens in music sequences, highlighting the evolving field of audio technology.

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner