

John Yue from Inference.ai
John Yue is the CEO and Co-Founder of Inference.ai.
Inference.ai is a leading Infrastructure as a Service company. Their mission is rooted in the belief that the intelligence of AI is directly proportional to the computational power it harnesses.
With the world's largest fleet of GPUs, they empower companies to transform their AI ambitions into reality by providing the computational muscle needed for smarter, more efficient AI.
Takeaways
- Competing with the big three (AWS, Azure and Google) and why there is need for new players.
- What it takes to build an infrastructure company and what infrastructure as a service (IaaS) means.
- A deep chat around GPUs, the space, the industry dynamics and challenges with shortages.
- Discuss why a16z is stockpiling GPUs and giving access to their portcos.
- The shortage of GPUs is expected to last for 7 to 10 years.
- We discuss the dynamics of LLMs and inferencing.
- Advantages of being in the Bay Area with an AI startup.
Chapters
00:00 Building a Better Cloud Storage and Infrastructure
04:45 Competing with AWS and Finding the Gap in the Market
07:41 The Long-Term GPU Shortage and the Need for Creative Solutions
11:21 Stability, Flexibility, and Affordability: Key Factors in Attracting Customers
18:36 The Future of the Big Three and the Rise of New Infrastructure Providers
21:04 The Benefits and Challenges of Being in Silicon Valley
25:09 Finding Advisors and Investors with AI and Hardware Expertise
30:33 Now is the Time for Canadian Founders to Start AI Companies
42:17 Journey of an Entrepreneur
Keywords
AI infrastructure, GPU shortage, inferencing, Silicon Valley, founders, AI boom