

Superhuman Performance of a LLM on the Reasoning Tasks of a Physician
Jan 23, 2025
Dr. Adam Rodman, a physician at Beth Israel Deaconess Medical Center and expert in large language models, discusses the revolutionary impact of AI on clinical reasoning. They explore the O1 model's superior diagnostic performance compared to previous iterations like GPT-4. The conversation delves into how enhanced machine reasoning abilities could transform neurology and reduce human error. Rodman emphasizes the importance of physicians in shaping technology policy to ensure responsible integration of these tools into patient care while maintaining the human touch in medicine.
AI Snips
Chapters
Transcript
Episode notes
LLMs and Reasoning
- Large language models (LLMs) are fundamentally word prediction tools.
- They exhibit emergent reasoning properties, especially in diagnosis, likely due to semantic grouping similar to human cognition.
Chain-of-Thought Prompting
- Chain-of-thought prompting improves LLM reasoning by making them "show their work".
- Bizarre prompts like simulating existential threats enhance performance.
O1 Model's Advantage
- OpenAI's O1 model fine-tunes the chain-of-thought process rather than just outputs.
- This computationally intensive approach results in superior performance on cognitive tasks.