Latent Space: The AI Engineer Podcast cover image

Latent Space: The AI Engineer Podcast

97% Cheaper, Faster, Better, Correct AI — with Varun Mohan of Codeium

Mar 2, 2023
Varun Mohan, CEO of Codeium and former tech lead at Nuro, dives deep into the evolving landscape of AI infrastructure. He shares insights on achieving GPU efficiency and dynamic multiplexing for optimal model performance. The discussion tackles the significance of robust infrastructure in model development and the impact of AI in code generation, noting that 60-70% of AI-generated code is retained. Varun also explores the challenges of scaling AI applications and the transformative potential of AI in both software and legal industries.
50:52

Episode guests

Podcast summary created with Snipd AI

Quick takeaways

  • XA Function optimizes GPU utilization for cost-effective deep learning workloads, helping companies cut costs by 97%.
  • Codeium, developed by XA Function, offers free code completion with outstanding efficiency and real-time generation speed, attracting over 10,000 users.

Deep dives

XA Function: Making Deep Learning Infrastructure Easier

XA Function was founded to address the challenges of building and maintaining deep learning infrastructure. The company focuses on optimizing GPU utilization and virtualization to ensure efficient and cost-effective deep learning workloads. By decoupling GPUs from workloads, XA Function enables companies to confidently run deep learning models at scale without breaking the bank. They have helped companies improve GPU utilization and reduce costs significantly, with one customer cutting costs by 97% by optimizing GPU usage. XA Function's expertise lies in finding the right ML architecture for different use cases, with most enterprises benefiting from off-the-shelf models like BERT or ResNet for tasks like fine-tuning or vision applications. The company believes that the future of complete code generation lies in finding the right balance between latency, quality, and correctability, and they are dedicated to developing products that offer clear ROI and value to professionals.

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner