The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

How OpenAI Builds AI Agents That Think and Act with Josh Tobin - #730

433 snips

May 6, 2025

Josh Tobin, a member of the technical staff at OpenAI and co-founder of Gantry, dives into the fascinating world of AI agents. He discusses OpenAI's innovative offerings like Deep Research and Operator, highlighting their ability to manage complex tasks through advanced reasoning. The conversation also explores unexpected use cases for these agents and the future of human-AI collaboration in software development. Additionally, Josh emphasizes the challenges of ensuring trust and safety as AI systems evolve, making for an insightful and thought-provoking discussion.

Ask episode

AI Snips

Chapters

Transcript

Episode notes

INSIGHT

Agent Training Unlocks Better Workflows

Training models end to end on multi-step tasks enables them to learn strategies better than manual human designs.
Models can learn to recover from errors and improve workflow performance through reinforcement learning.

INSIGHT

Foundation Models Shift Business Strategy

General purpose models like GPT-4 reduce the need for most businesses to train their own models.
Building on commercial foundation models is faster and cheaper than creating custom domain models.

INSIGHT

Training Agents to Self-Correct Errors

Traditional LLMs struggle with multi-step tasks due to compounding errors and lack of agentic training.
RL-trained agents can detect failures and self-correct, increasing reliability in complex workflows.

Get the Snipd Podcast app to discover more snips from this episode

Get the app