ThursdAI - The top AI news from the past week

📆 ThursdAI - Dec 26 - OpenAI o3 & o3 mini, DeepSeek v3 658B beating Claude, Qwen Visual Reasoning, Hume OCTAVE & more AI news

26 snips
Dec 27, 2024
Get ready for a whirlwind of AI breakthroughs! OpenAI's latest models sparked debates over AGI, while DeepSeek stunned everyone with a powerful open-source LLM. Discover how the new multimodal model from Qwen excels in visual reasoning tasks. The conversation dives deep into innovative voice generation and the security challenges that come with it. Join insights on the year's highlights in AI, along with predictions for 2024, as the community reflects on rapid advancements and the excitement brewing in the AI landscape.
Ask episode
AI Snips
Chapters
Transcript
Episode notes
INSIGHT

DeepSeek v3 Matches Claude

  • DeepSeek v3, a 658B parameter MoE model, beats Claude 3.5 on coding benchmarks.
  • This achievement marks a significant milestone for open-source AI.
ADVICE

Downloading DeepSeek v3

  • Don't download DeepSeek v3 yet, as necessary integrations are still pending.
  • Wait for official updates to Transformers Library and Llama CPP.
INSIGHT

DeepSeek Training Insight

  • DeepSeek used distillation from their reasoning model (R1) and synthetic data.
  • This approach contributed to the model's high performance.
Get the Snipd Podcast app to discover more snips from this episode
Get the app