How GPT-5 Thinks — OpenAI VP of Research Jerry Tworek

328 snips

Oct 16, 2025

Join Jerry Tworek, VP of Research at OpenAI, as he dives into the fascinating world of AI reasoning. Discover how GPT-5 evolves from earlier models, emphasizing the crucial roles of pretraining and reinforcement learning. Jerry explains the mechanics of chain-of-thought reasoning, the significance of agentic tools like Codex, and the importance of robust collaboration in research. He even shares insights from his personal journey from math and trading to cutting-edge AI research. Could pretraining combined with RL be key to achieving AGI? Tune in to find out!

Ask episode

AI Snips

Chapters

Transcript

Episode notes

INSIGHT

What AI Reasoning Actually Is

Reasoning is the process of getting to answers you don't already know by performing intermediate work and search-like steps.
Chain-of-thought verbalizes that process in words, letting models show systematic intermediate steps to reach solutions.

INSIGHT

How Chain-Of-Thought Helps

Chain-of-thought prompts coax models to generate stepwise tokens that perform partial computations and then combine them into an answer.
Models learn these human-like stepwise procedures from pretraining on internet text containing many examples of human reasoning.

ADVICE

Optimize Thinking Time For UX

Balance compute (thinking time) versus user wait time by exposing options for higher or lower 'thinking' settings to users.
Tune heuristics to predict when extra thinking is worth the user's waiting time and offer auto modes accordingly.

Get the Snipd Podcast app to discover more snips from this episode

Get the app