Last Week in AI cover image

Last Week in AI

#195 - OpenAI o3 & for-profit, DeepSeek-V3, Latent Space

Jan 5, 2025
OpenAI unveils exciting advancements in its O3 model, significantly boosting reasoning capabilities. Meanwhile, tensions simmer between Microsoft and OpenAI over their partnership as the latter shifts to a for-profit model. Chinese firms like DeepSeek are making waves with their impressive open-source AI models, showcasing innovation in performance. Sakana AI adds curiosity-driven exploration to the mix by applying AI in the search for artificial life, hinting at the limitless possibilities ahead in the realm of artificial intelligence.
01:39:05

Podcast summary created with Snipd AI

Quick takeaways

  • OpenAI's O3 model showcases significant advancements in reasoning, achieving 72% accuracy on the SWE Bench verified benchmark, highlighting its problem-solving improvements.
  • The transition of OpenAI to a for-profit model raises ethical concerns regarding public accountability, potentially prioritizing investor returns over societal interests in AI safety.

Deep dives

Introduction of OpenAI's O3 Model

The O3 model from OpenAI demonstrates significant advancements in reasoning capabilities. With a remarkable score of 72% accuracy on the SWE Bench verified benchmark, O3 exhibits a notable improvement over its predecessor, O1, which only managed 49%. Additionally, O3 performed impressively on other evaluations, such as achieving 97% on the AMI benchmark, highlighting its enhanced problem-solving skills. As it undergoes public safety testing, users are invited to apply for access to further explore its capabilities.

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode