Gavin Uberti - Real-Time AI & The Future of AI Hardware - [Invest Like the Best, EP.356]

Invest Like the Best with Patrick O'Shaughnessy

Optimizing Inference and Training in Large Transformer Models

2min Snip

00:00

Play full episode

Summary

Transcript

Episode notes

Large transformer models require optimizing inference by batching a huge number of queries to amortize the expensive price of loading weights from memory and make inference much cheaper. This centralization of inference is expected to be similar to the centralization of training, due to the need for a large number of grouped users. The power-hungry nature of training is attributed to the need to compute the contribution of each weight to the final error by running the model backwards, which requires about double the compute and different network primitives compared to running forwards.

Today, my guest is 21-year-old Gavin Uberti, who dropped out of Harvard to build Etched, which is one of the most fascinating companies I’ve seen. The topic of our conversation is the ongoing revolution in artificial intelligence, and more specifically the chips and technology that powers these incredible models. To date, general-purpose AI chips like Nvidia GPUs have powered the revolution, but Gavin’s bet is that purpose-built chips, hard-coded for the underlying model architecture, will dramatically reduce the latency and cost of running models like GPT4. We’re about to embark on what Gavin calls the “largest infrastructure buildout since the industrial revolution”, and I won’t spoil what he thinks this will unlock for all of us. It is so uplifting to me that someone so young can be working on something so big. Please enjoy this great conversation with Gavin Uberti.

Check out Etched.AI

Listen to Founders Podcast

For the full show notes, transcript, and links to mentioned content, check out the episode page here.

-----

This episode is brought to you by Tegus. Tegus is the modern research platform for leading investors, and provider of Canalyst. Tired of calculating fully diluted shares outstanding? Access every publicly-reported data point and industry-specific KPI through their database of over 4,000 drivable global models hand-built by a team of sector-focused analysts, 35+ industry comp sheets, and Excel add-ins that let you use their industry-leading data in your own spreadsheets. Tegus’ models automatically update each quarter, including hard-to-calculate KPIs like stock-based compensation and organic growth rates, empowering investors to bypass the friction of sourcing, building, and updating models. Make efficiency your competitive advantage and take back your time today. As a listener, you can trial Canalyst by Tegus for free by visiting tegus.co/patrick.

-----

Invest Like the Best is a property of Colossus, LLC. For more episodes of Invest Like the Best, visit joincolossus.com/episodes.

Past guests include Tobi Lutke, Kevin Systrom, Mike Krieger, John Collison, Kat Cole, Marc Andreessen, Matthew Ball, Bill Gurley, Anu Hariharan, Ben Thompson, and many more.

Stay up to date on all our podcasts by signing up to Colossus Weekly, our quick dive every Sunday highlighting the top business and investing concepts from our podcasts and the best of what we read that week. Sign up here.

Show Notes:

(00:03:41) - (first question) - Born too late to explore the world, too early to explore the stars

(00:05:59) - Interpreting and defining superintelligence

(00:07:20) - Excitement we can have for an AI driven future

(00:09:25) - Overview and basic terminology of the transformers that power AI

(00:15:53) - What Q* is and the rumors around it

(00:20:41) - Robotics, machinery, and what’s interesting about them

(00:23:18) - The problem of latency and computing power

(00:8:55) - Needing to build physical infrastructure that doesn’t exist

(00:32:18) - Inference and training AI models

(00:36:00) - Major stages of chip design and the upper limits of speed

(00:45:56) - Customers for billion dollar generative AI models

(00:48:56) - A sci-fi-esque reality and the politicization of AI

(00:50:38) - The Bitter Lesson and the implications of powerful AI models

(00:56:27) - The most important companies in the AI space today

(00:61:52) - Strategically building a defensible AI product

(01:04:07) - Software development and why other AI companies fail

(01:06:51) - Specialization and chip performance improvement

(01:15:34) - Why the transformer remains the leading architecture

(01:17:26) - A proliferation of models beyond the major players and data access

(01:21:19) - The kindest thing anyone has ever done for Gavin

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Gavin Uberti - Real-Time AI & The Future of AI Hardware - [Invest Like the Best, EP.356]

Invest Like the Best with Patrick O'Shaughnessy

Optimizing Inference and Training in Large Transformer Models

2min Snip

Get the Snipdpodcast app

AI-poweredpodcast player

Discoverhighlights

Save anymoment

Share& Export

AI-poweredpodcast player

Discoverhighlights

Get the Snipd
podcast app

AI-powered
podcast player

Discover
highlights

Save any
moment

Share
& Export

AI-powered
podcast player

Discover
highlights