The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

Is ChatGPT Getting Worse? with James Zou - #645

11 snips
Sep 4, 2023
In this conversation, James Zou, an assistant professor at Stanford known for his work in biomedical data science, dives into the evolving landscape of ChatGPT. He examines its fluctuating performance over recent months, discussing intriguing comparisons between versions. The potential for surgical AI enhancements inspires thoughts on the future of large language models. Zou also shares innovative insights on using Twitter data to build medical imaging datasets, addressing the challenges of data quality and oversight in AI for healthcare applications.
Ask episode
AI Snips
Chapters
Transcript
Episode notes
INSIGHT

ChatGPT's Evolving Behavior

  • ChatGPT's behavior changes over time, impacting various applications.
  • This necessitates systematic assessment across diverse tasks to understand these shifts.
ANECDOTE

Evaluating Complex Output

  • Evaluating ChatGPT's complex output requires diverse methods.
  • Prime number identification illustrates how accuracy can be assessed for simpler tasks.
INSIGHT

Surprising Performance Drop

  • Surprisingly, later ChatGPT versions performed worse on some tasks, including prime number identification.
  • Chain-of-thought reasoning, effective in March, declined in June, impacting accuracy.
Get the Snipd Podcast app to discover more snips from this episode
Get the app