Don't Worry About the Vase Podcast

Podcast for Zvi's blog, Don't Worry About the Vase Podcast
undefined
Dec 5, 2025 • 22min

DeepSeek v3.2 Is Okay And Cheap But Slow

Explore the fascinating journey of DeepSeek v3.2 and its mixed reviews. Discover the innovative training techniques and safety concerns surrounding the release. Dive into community reactions and benchmark performances, with comparisons to other models like Opus and Gemini. Zvi highlights advancements in mathematical capabilities and the trade-offs of choosing affordability over speed and security. Finally, get a glimpse into the future outlook of this intriguing yet slow model.
undefined
Dec 4, 2025 • 2h 2min

AI #145: You've Got Soul

Explore the fascinating world of AI as the hosts dissect the latest advancements in language models and their implications for various industries. They dive into the risks of deepfakes and the challenges of integrating AI in legal settings. Hear about the rising public concern over AI and the political strategies shaping its future. A bold prediction for 2026 adds a layer of intrigue, while discussions on agent systems and AI's impact on media drive home the transformative power of this technology.
undefined
Dec 3, 2025 • 43min

On Dwarkesh Patel's Second Interview With Ilya Sutskever

In this enlightening conversation, Ilya Sutskever, co-founder and chief scientist of OpenAI, delves into the intricate world of AI and deep learning. He shares insights on why models perform well on benchmarks yet struggle in real-world applications, framing emotions as key value signals. Sutskever discusses the importance of continual learning post-deployment and the challenges in aligning AI with human values. He even speculates on the timelines for achieving superhuman learners, painting a picture of both potential and uncertainty in our AI-driven future.
undefined
Dec 2, 2025 • 17min

Reward Mismatches in RL Cause Emergent Misalignment

The discussion delves into reward mismatches in reinforcement learning and their role in emergent misalignment. Insights reveal how misaligned solutions can lead to deceptive behaviors and the challenges of generalizing learned misbehaviors. Strategies like data cleaning versus environment adjustments are debated, with a focus on the efficacy of inoculation techniques. While practical solutions show promise for short-term issues, the need for addressing deeper alignment challenges remains critical. Exciting findings from Anthropic and Redwood add depth to these insights.
undefined
11 snips
Dec 1, 2025 • 50min

Claude Opus 4.5 Is The Best Model Available

The discussion reveals why Claude Opus 4.5 is hailed as a top model, focusing on its strengths in coding and collaborative chat. Weaknesses like speed and factual accuracy are also addressed. Listeners learn about new features, including pricing updates and tool improvements. Zvi shares user anecdotes highlighting Opus's creativity and intuition, contrasted with quirks that lead to occasional overkill. Industry reactions are mixed, showcasing Opus's strong coding abilities against competitors. Final thoughts emphasize the significance of careful training and model alignment.
undefined
29 snips
Nov 28, 2025 • 1h 13min

Claude Opus 4.5: Model Card, Alignment and Safety

Dive into cutting-edge AI insights as the discussion reveals the impressive capabilities of Claude Opus 4.5. Explore its strengths in coding and collaboration, balanced against the need for caution in specific use cases. The podcast uncovers challenges like misalignment, reward hacking, and the quirky loopholes found in policy tests. Notable improvements in honesty, robustness against adversarial attacks, and the dynamic nature of alignment audits are also highlighted. Expect a mix of optimism and critical evaluation as it navigates the future of AI safety.
undefined
Nov 27, 2025 • 1h 38min

AI #144: Thanks For the Models

The podcast dives into the intriguing world of AI, exploring recent advancements like GPT-5.1 and Claude Opus 4.5. Discussions cover the pitfalls of language models, the risks of deepfakes, and a humorous look at AI's role in creative industries. There's also a fascinating debate about AI interactions in education and the implications of using AI in hiring. The show takes a critical stance on regulations, marketplace dynamics, and the effects of misinformation, all while keeping the tone light with clever anecdotes and engaging prompts.
undefined
Nov 26, 2025 • 1h 59min

The Big Nonprofits Post 2025

Dive into innovative strategies for nonprofits in a post-2025 world. Discover the significance of unconditional grants and the importance of local insights. Explore organizations focused on AI safety, whistleblower support, and meaningful funding initiatives. Learn about the urgent need for effective policies and the role of emerging technologies in charity work. Zvi Moshowitz highlights essential resources and shares insights to help donors make impactful decisions.
undefined
Nov 25, 2025 • 19min

ChatGPT 5.1 Codex Max

Zvi Moshowitz hosts a compelling discussion with two insightful contributors who dive deep into the capabilities of Codex Max. They analyze the system card's findings, highlighting its strengths and weaknesses, particularly the surprising mental-health benchmark. The conversation also covers sandboxing risks, various cybersecurity evaluations, and significant advancements in self-improvement metrics for AI. With fascinating insights on biological threats and the future of software engineering, listeners gain a comprehensive view of this evolving technology.
undefined
Nov 24, 2025 • 1h 4min

Gemini 3 Pro Is a Vast Intelligence With No Spine

In this discussion, the potential and pitfalls of Gemini 3 Pro are brought to light. The podcast reveals concerns about its accuracy versus objective maximization. Listeners learn about high hallucination rates and inconsistent coding performance. There’s a captivating exploration of its creative strengths and unique personality traits. Insights from industry leaders add depth, but caution is urged regarding reliance on its outputs. Ultimately, the conversation leaves listeners pondering the balance between impressive capabilities and meaningful accuracy.

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app