Latent Space: The AI Engineer Podcast cover image

LLMs Everywhere: Running 70B models in browsers and iPhones using MLC — with Tianqi Chen of CMU / OctoML

Latent Space: The AI Engineer Podcast

00:00

From OctoMail to OctoAI: Navigating AI Challenges

This chapter explores the transformation of OctoMail into OctoAI, focusing on optimizing model runtimes and easing deployment hurdles. It discusses the shift from custom models to pre-trained generative ones, highlighting the engineering challenges of scalability and integration faced by enterprises. The conversation also touches on the future of AI, emphasizing continuous learning and the interplay between model optimization, algorithms, and data in evolving AI applications.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app