

Speeding up Generative AI models | Luis Ceze, cofounder and CEO of OctoML
Luis Ceze is the cofounder and CEO of OctoML, a platform that offers compute infrastructure to fine-tune, run, and scale your AI models. He's a professor at University of Washington and a venture partner at Madrona. He was previously the cofounder of Corensic. He has a PhD in Computer Science from University of Illinois Urbana-Champaign.
In this episode, we cover a range of topics including:
- OctoAI product announcement
- How to make LLMs faster and cheaper
- Training your own LLMs
- The perceived shortage of AI compute
- Enterprise spend on AI compute
- Applications being built using OctoML
- Domain specific models
Luis's favorite books:
- Thinking, Fast and Slow (Author: Daniel Kahneman)
- Blindness (Author: Jose Saramago)
--------
Where to find Prateek Joshi:
Newsletter: https://prateekjoshi.substack.com
Website: https://prateekj.com
LinkedIn: https://www.linkedin.com/in/prateek-joshi-91047b19
Twitter: https://twitter.com/prateekvjoshi