LessWrong (30+ Karma) cover image

LessWrong (30+ Karma)

“AI #113: The o3 Era Begins” by Zvi

Apr 25, 2025
02:00:41
Enjoy it while it lasts. The Claude 4 era, or the o4 era, or both, are coming soon. Also, welcome to 2025, we measure eras in weeks or at most months. For now, the central thing going on continues to be everyone adapting to the world of o3, a model that is excellent at providing mundane utility with the caveat that it is a lying liar. You need to stay on your toes. This was also quietly a week full of other happenings, including a lot of discussions around alignment and different perspectives on what we need to do to achieve good outcomes, many of which strike me as dangerously mistaken and often naive. I worry that growingly common themes are people pivoting to some mix of ‘alignment is solved, we know how to get an AI to do what we want it to do, the question is alignment to [...]

---

Outline:

(01:33) Language Models Offer Mundane Utility

(05:27) You Offer the Models Mundane Utility

(07:25) Your Daily Briefing

(08:20) Language Models Don't Offer Mundane Utility

(12:27) If You Want It Done Right

(14:27) No Free Lunch

(16:07) What Is Good In Life?

(21:54) In Memory Of

(25:45) The Least Sincere Form of Flattery

(27:18) The Vibes are Off

(30:47) Here Let Me AI That For You

(32:25) Flash Sale

(34:38) Huh, Upgrades

(36:03) On Your Marks

(44:03) Be The Best Like No LLM Ever Was

(48:40) Choose Your Fighter

(51:00) Deepfaketown and Botpocalypse Soon

(54:57) Fun With Media Generation

(56:11) Fun With Media Selection

(57:39) Copyright Confrontation

(59:38) They Took Our Jobs

(01:05:31) Get Involved

(01:05:41) Ace is the Place

(01:09:43) In Other AI News

(01:11:31) Show Me the Money

(01:12:49) The Mask Comes Off

(01:16:54) Quiet Speculations

(01:20:34) Is This AGI?

(01:22:39) The Quest for Sane Regulations

(01:23:03) Cooperation is Highly Useful

(01:25:47) Nvidia Chooses Bold Strategy

(01:27:15) How America Loses

(01:28:07) Security Is Capability

(01:31:38) The Week in Audio

(01:33:15) AI 2027

(01:34:38) Rhetorical Innovation

(01:38:55) Aligning a Smarter Than Human Intelligence is Difficult

(01:46:30) Misalignment in the Wild

(01:51:13) Concentration of Power and Lack of Transparency

(01:57:12) Property Rights are Not a Long Term Plan

(01:58:48) It Is Risen

(01:59:46) The Lighter Side

The original text contained 1 footnote which was omitted from this narration.

---

First published:
April 24th, 2025

Source:
https://www.lesswrong.com/posts/7x9MZCmoFA2FtBtmG/ai-113-the-o3-era-begins

---

Narrated by TYPE III AUDIO.

---

Images from the article:

Side-by-side comparison showing original movie scene and AI recreation.
ChatGPT interface showing conversation about daily science news summary scheduling.
Bar graph showing
Diagram showing relationships between Frontier Safety Policies, Evaluations, Mitigations, and oversight frameworks.
Organizational diagram showing Internal Deployment Team and Board's policy management structure.
A video thumbnail showing
Three text messages showing
Timeline graph showing
Three-column comparison chart showing Strong Support, Reframing, and Strong Resistance values
Leaderboard table comparing AI models' performance metrics including net worth and sales data.
Bar graph showing
Text excerpt discussing AGI capabilities and timeline, mentioning chess, storytelling, and baking abilities.
Graph showing action prediction latency comparison across nine different AI models/systems, ranging from 324ms to 12642ms.
Graph showing
Graph showing AI language model performance versus price per million tokens, with pareto frontier line. Points represent different models from major tech companies, including Google, OpenAI, Meta, and others.
Diagram showing AI values analysis, feature extraction, and human conversations framework.

This flow chart illustrates how AI systems respond to different types of user requests, mapping out values taxonomies (epistemic and personal), feature extraction processes, and AI response patterns. The visualization includes real conversation examples and their corresponding value classifications.
Performance comparison table showing benchmark scores for different AI language models.

The table compares various metrics including pricing, reasoning ability, science knowledge, mathematics, code generation, and other capabilities across models like Gemini, OpenAI, Claude, Grok 3, and DeepSeek R1.
Bar graphs showing
Hierarchical diagram showing
Table comparing governance safeguards, showing current status versus proposed restructuring.

The table shows six key governance safeguards related to charitable purpose, fiduciary duties, profit controls, board composition, AGI ownership, and stop-and-assist commitments. Each item compares
Magazine cover
Historical graph showing interest rates and loan trends from 1310-2018 across European powers.
A street advertisement for

Apple Podcasts and Spotify do not show images in the episode description. Try Pocket Casts, or another podcast app.

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner