Everyday AI Podcast – An AI and ChatGPT Podcast

Ep 662: Opus 4.5: New king of the AI hill or just a niche model for coders?

68 snips
Nov 26, 2025
The latest AI showdown has arrived with the debut of Claude Opus 4.5, claimed to be the best model for coding and agentic tasks. Is this the new go-to for developers or just a niche player? A deep dive into its benchmarks reveals a mixed performance compared to Gemini 3 Pro. Exciting features like document consistency and a Chrome extension make waves, while live demos reveal both strengths and limitations. Join the discussion about whether Opus 4.5 will reign supreme or serve a specialized audience!
Ask episode
AI Snips
Chapters
Transcript
Episode notes
INSIGHT

Opus 4.5 Is A Vertical Power Play

  • Anthropic's Opus 4.5 targets coding, agents, and computer use as its core strengths.
  • Jordan highlights it as a focused vertical play rather than a general-purpose model.
INSIGHT

Benchmarks Are Nuanced Not Absolute

  • Benchmarks show Opus 4.5 leads in agentic and software-engineering tasks but not uniformly across all third-party aggregates.
  • Jordan notes Gemini 3 Pro and GPT-5 variants still outperform Opus on several aggregated coding indexes.
INSIGHT

Anthropic's Strategy Is Vertical Specialization

  • Anthropic appears to focus its model development on vertical specialties like engineering and finance rather than broad creative tasks.
  • Jordan sees this as a deliberate strategy away from general-purpose creativity toward domain strength.
Get the Snipd Podcast app to discover more snips from this episode
Get the app