Utilizing Model Specific Chips for Faster Response Times | 34sec snip from Invest Like the Best with Patrick O'Shaughnessy

Get the app

Gavin Uberti - Real-Time AI & The Future of AI Hardware - [Invest Like the Best, EP.356]

Invest Like the Best with Patrick O'Shaughnessy

chevron_right

notes

NOTE

Utilizing Model Specific Chips for Faster Response Times

Utilizing model specific chips, specifically designed for transformer models, can significantly reduce initial delay by increasing compute capacity and efficient utilization. By embedding these models into chips with high memory reading efficiency, it is possible to achieve over 90% utilization and drastically reduce response times from milliseconds to seconds.

00:00

Transcript

chevron_right

Play full episode

chevron_right

Transcript

Episode notes

Today, my guest is 21-year-old Gavin Uberti, who dropped out of Harvard to build Etched, which is one of the most fascinating companies I’ve seen. The topic of our conversation is the ongoing revolution in artificial intelligence, and more specifically the chips and technology that powers these incredible models. To date, general-purpose AI chips like Nvidia GPUs have powered the revolution, but Gavin’s bet is that purpose-built chips, hard-coded for the underlying model architecture, will dramatically reduce the latency and cost of running models like GPT4. We’re about to embark on what Gavin calls the “largest infrastructure buildout since the industrial revolution”, and I won’t spoil what he thinks this will unlock for all of us. It is so uplifting to me that someone so young can be working on something so big. Please enjoy this great conversation with Gavin Uberti.

Check out Etched.AI

Listen to Founders Podcast

For the full show notes, transcript, and links to mentioned content, check out the episode page here.

-----

This episode is brought to you by Tegus. Tegus is the modern research platform for leading investors, and provider of Canalyst. Tired of calculating fully diluted shares outstanding? Access every publicly-reported data point and industry-specific KPI through their database of over 4,000 drivable global models hand-built by a team of sector-focused analysts, 35+ industry comp sheets, and Excel add-ins that let you use their industry-leading data in your own spreadsheets. Tegus’ models automatically update each quarter, including hard-to-calculate KPIs like stock-based compensation and organic growth rates, empowering investors to bypass the friction of sourcing, building, and updating models. Make efficiency your competitive advantage and take back your time today. As a listener, you can trial Canalyst by Tegus for free by visiting tegus.co/patrick.

-----

Invest Like the Best is a property of Colossus, LLC. For more episodes of Invest Like the Best, visit joincolossus.com/episodes.

Past guests include Tobi Lutke, Kevin Systrom, Mike Krieger, John Collison, Kat Cole, Marc Andreessen, Matthew Ball, Bill Gurley, Anu Hariharan, Ben Thompson, and many more.

Stay up to date on all our podcasts by signing up to Colossus Weekly, our quick dive every Sunday highlighting the top business and investing concepts from our podcasts and the best of what we read that week. Sign up here.

Show Notes:

(00:03:41) - (first question) - Born too late to explore the world, too early to explore the stars

(00:05:59) - Interpreting and defining superintelligence

(00:07:20) - Excitement we can have for an AI driven future

(00:09:25) - Overview and basic terminology of the transformers that power AI

(00:15:53) - What Q* is and the rumors around it

(00:20:41) - Robotics, machinery, and what’s interesting about them

(00:23:18) - The problem of latency and computing power

(00:8:55) - Needing to build physical infrastructure that doesn’t exist

(00:32:18) - Inference and training AI models

(00:36:00) - Major stages of chip design and the upper limits of speed

(00:45:56) - Customers for billion dollar generative AI models

(00:48:56) - A sci-fi-esque reality and the politicization of AI

(00:50:38) - The Bitter Lesson and the implications of powerful AI models

(00:56:27) - The most important companies in the AI space today

(00:61:52) - Strategically building a defensible AI product

(01:04:07) - Software development and why other AI companies fail

(01:06:51) - Specialization and chip performance improvement

(01:15:34) - Why the transformer remains the leading architecture

(01:17:26) - A proliferation of models beyond the major players and data access

(01:21:19) - The kindest thing anyone has ever done for Gavin

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.

Home Top podcasts Popular guests