AI won't plateau — if we give it time to think | Noam Brown
Feb 1, 2025
auto_awesome
Noam Brown, a leading research scientist at OpenAI, shares insights on transforming AI development by emphasizing slow, deliberate reasoning. He discusses the evolution of poker AI, detailing how thoughtful deliberation improved its performance against human players. Brown highlights the importance of patience and investment in AI to avoid stagnation, arguing that advancements are still on the horizon. He also illustrates how giving AI more thinking time can unlock extraordinary potential, challenging the idea that AI's progress is plateauing.
Traditional AI enhances its capabilities through exponential data scaling, but the new o1 model prioritizes deliberate reasoning akin to human thought.
Allowing AI systems extended thinking time leads to significantly improved decision-making, suggesting a pivotal shift in AI training methodologies.
Deep dives
The Evolution of AI Through Scale
The progression of artificial intelligence over the last five years has largely been driven by scale, involving both the amount of data used for training and the computational resources required. Although the underlying transformer architecture remains consistent, the size and complexity of the models have been growing exponentially. In 2019, training models like GPT-2 cost around $5,000, while today’s frontier models can require hundreds of millions of dollars to train. This raises questions about the sustainability of such scaling practices and whether AI development might soon plateau as it becomes increasingly expensive.
The Role of Thinking Time in AI Performance
Research into the performance of AI models has revealed that allowing these systems additional thinking time can significantly enhance their decision-making capabilities. A striking example is found in the development of a poker AI, which improved dramatically when allowed to think for just 20 seconds per hand, equating this time investment to a 100,000-fold increase in model training. This insight suggests that cultivating 'System 2' thinking—more deliberate and methodical processing—can yield improvements beyond merely increasing system complexity. Other AI systems, like chess and Go players, also demonstrate similar benefits from extended thinking time, highlighting a nuanced approach to AI training.
A New Paradigm in AI Development
The advent of more sophisticated models, such as the new language processing AI O1, marks a pivotal shift in AI development strategies. By prioritizing longer thinking times without sacrificing response quality, these models open up new avenues for scaling AI capabilities. Despite potential concerns regarding response times and costs, the trade-off may be justified when applied to critical issues like healthcare or climate change. This redefinition of what AI can achieve signals that a new era is underway, emphasizing the importance of both efficient data training and strategic thinking time.
To get smarter, traditional AI models rely on exponential increases in the scale of data and computing power. Noam Brown, a leading research scientist at OpenAI, presents a potentially transformative shift in this paradigm. He reveals his work on OpenAI's new o1 model, which focuses on slower, more deliberate reasoning — much like how humans think — in order to solve complex problems.