Last Week in AI

#198 - DeepSeek R1 & Janus, Qwen2.5, OpenAI Agents

366 snips
Feb 2, 2025
DeepSeek has launched R1, a competitive AI model causing a stir as tech stocks plummet, including a significant drop for NVIDIA. OpenAI's new tool, Operator, aims to enhance user experiences amidst rising competition. In a surprising move, President Trump has revoked the Biden administration's AI executive order, hinting at a shift in policy. Meanwhile, Taiwan's TSMC is permitted to produce advanced 2-nanometer chips abroad, highlighting the global semiconductor landscape and its geopolitical implications.
Ask episode
AI Snips
Chapters
Books
Transcript
Episode notes
ANECDOTE

Listener Critique and DeepSeek Coverage

  • A listener criticized the podcast for being "behind the curve," citing the hardware episode before DeepSeek R1.
  • Andrey Kurenkov pointed out that the podcast had actually covered DeepSeek V3 and its significance.
INSIGHT

DeepSeek R1's Reasoning Optimization

  • DeepSeek R1, comparable to OpenAI's O1, optimizes reasoning in LLMs via reinforcement learning.
  • It showcases RL's potential, achieving impressive results with relatively few resources.
INSIGHT

Reinforcement Learning's Power in DeepSeek R1

  • DeepSeek R1's success demonstrates the power of reinforcement learning (RL) for reasoning.
  • Simply rewarding correct answers organically encourages chain-of-thought-like reasoning.
Get the Snipd Podcast app to discover more snips from this episode
Get the app