The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence) cover image

The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

Is ChatGPT Getting Worse? with James Zou - #645

Sep 4, 2023
In this conversation, James Zou, an assistant professor at Stanford known for his work in biomedical data science, dives into the evolving landscape of ChatGPT. He examines its fluctuating performance over recent months, discussing intriguing comparisons between versions. The potential for surgical AI enhancements inspires thoughts on the future of large language models. Zou also shares innovative insights on using Twitter data to build medical imaging datasets, addressing the challenges of data quality and oversight in AI for healthcare applications.
42:17

Episode guests

Podcast summary created with Snipd AI

Quick takeaways

  • The performance of ChatGPT varied significantly across different tasks, highlighting the need for more precise edits and control in large language models.
  • Leveraging data from social media platforms like Twitter can be valuable for training AI systems in specific domains, such as pathology image analysis.

Deep dives

Chet GPT and Changes in Behavior

The podcast episode discusses the research on the change in behavior of Chet GPT, a popular language model. The researchers conducted a systematic assessment by comparing the performance of Chet GPT in March and June versions across various tasks. Surprisingly, they found that while the later version performed better in some tasks, it performed substantially worse in others, including seemingly simpler tasks like identifying prime numbers. The researchers explored possible reasons for this change in behavior, such as conflicting objectives and the concept of "pleiotropy". They highlighted the need for more precise and surgical edits to large language models to improve control and transparency.

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner