LessWrong (Curated & Popular)

“What o3 Becomes by 2028” by Vladimir_Nesov

Jan 9, 2025

Vladimir Nesov, author known for his insights on AI, dives deep into the future of OpenAI's O3 training systems. He discusses the significance of upcoming investments in scaling AI capabilities with models projected to train at unprecedented FLOP levels. The conversation highlights the balance between data quality and quantity, emphasizing the need for 50 trillion training tokens. Nesov also evaluates the current state of GPT-4 and its competitors, pondering what advancements might emerge by 2028 in this rapidly evolving landscape.

08:40

Episode guests

Vladimir Nesov

AI Summary

AI Chapters

Episode notes

Podcast summary created with Snipd AI

Quick takeaways

By 2028, OpenAI's O3 is expected to drive advancements in AI training systems with capabilities reaching 5x10^28 FLOPs amidst significant funding support.

The race for AI scalability is intensifying as companies invest in massive data centers to enhance computational efficiency and model performance.

Deep dives

Future Training Systems and Performance

By 2028, significant advancements in training systems are anticipated, with funding now more substantial for developing technologies that can support immense computational demands. OpenAI's O3 may facilitate pre-training models capable of reaching 5x10^28 flops, while the constraints of natural text data will not hinder progress until much larger flops are achieved. This scaling of pre-training is expected to remain robust, demonstrating that leading models like GPT-4 set a benchmark for future developments despite others catching up. The race to enhance these systems is intensifying, with expected outputs surpassing current capabilities as hardware continues to evolve and expand.

Future Trajectories of AI Training Systems and O3

7min

Exploring the Value of Training Tokens and Computational Strategies

1min

Funding for $150bn training systems just turned less speculative, with OpenAI o3 reaching 25% on FrontierMath, 70% on SWE-Verified, 2700 on Codeforces, and 80% on ARC-AGI. These systems will be built in 2026-2027 and enable pretraining models for 5e28 FLOPs, while o3 itself is plausibly based on an LLM pretrained only for 8e25-4e26 FLOPs. The natural text data wall won't seriously interfere until 6e27 FLOPs, and might be possible to push until 5e28 FLOPs. Scaling of pretraining won't end just yet.

Reign of GPT-4

Since the release of GPT-4 in March 2023, subjectively there was no qualitative change in frontier capabilities. In 2024, everyone in the running merely caught up. To the extent this is true, the reason might be that the original GPT-4 was probably a 2e25 FLOPs MoE model trained on 20K A100. And if you don't already have a cluster this big, and experience [...]

---

Outline:

(00:52) Reign of GPT-4

(02:08) Engines of Scaling

(04:06) Two More Turns of the Crank

(06:41) Peak Data

The original text contained 3 footnotes which were omitted from this narration.

---

First published:
December 22nd, 2024

Source:
https://www.lesswrong.com/posts/NXTkEiaLA4JdS5vSZ/what-o3-becomes-by-2028

---

Narrated by TYPE III AUDIO.

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

LessWrong (Curated & Popular)

“What o3 Becomes by 2028” by Vladimir_Nesov

Episode guests

Podcast summary created with Snipd AI

Quick takeaways

Deep dives

Future Training Systems and Performance

Infrastructure Challenges and Scaling Potential

Get the Snipdpodcast app

AI-poweredpodcast player

Discoverhighlights

Save anymoment

Share& Export

AI-poweredpodcast player

Discoverhighlights

Get the Snipd
podcast app

AI-powered
podcast player

Discover
highlights

Save any
moment

Share
& Export

AI-powered
podcast player

Discover
highlights