The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence) cover image

Multimodal AI Models on Apple Silicon with MLX with Prince Canuma - #744

The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

00:00

Neural Engine Vs GPU Trade-Offs

  • The Neural Engine exists across Apple devices but constrains model size and requires special tracing and optimizations.
  • GPUs plus unified RAM let MLX load much larger quantized models than the Neural Engine can handle.
Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app