
GPT-5s Are Alive: Basic Facts, Benchmarks and the Model Card
Don't Worry About the Vase Podcast
00:00
Evaluating GPT-5: Safety and Red Team Insights
This chapter explores the extensive red teaming efforts conducted to assess the safety of GPT-5, including analyses of adversarial scenarios and performance metrics. It highlights improvements in model resilience against attacks, while also addressing the need for nuanced evaluation protocols in light of potential risks. The findings suggest that GPT-5 poses minimal catastrophic risks, but emphasize ongoing challenges in understanding its reasoning and behavioral subtleties.
Transcript
Play full episode