ThursdAI - The top AI news from the past week

📆 ThursdAI - Nov 21 - The fight for the LLM throne, OSS SOTA from AllenAI, Flux new tools, Deepseek R1 reasoning & more AI news

30 snips
Nov 22, 2024
Junyang Lin, Dev Lead at Alibaba's Qwen team, shares insights on the game-changing Qwen Coder 2.5 and its 1M context capabilities. Nathan Lambert, a research scientist at AI2, dives into the newly released SOTA post-trained models and emphasizes the importance of open-source contributions. Eric Simons, CEO of StackBlitz, discusses the groundbreaking capabilities of bolt.new, a tool that simplifies web development using AI. Together, they explore the competitive dynamics in the LLM landscape and the potential of collaboration in advancing AI technology.
Ask episode
AI Snips
Chapters
Transcript
Episode notes
INSIGHT

PixTral Large Performance

  • Mistral AI's PixTral Large claims state-of-the-art performance on visual benchmarks.
  • It leverages a strong language backbone and understands various visual inputs.
ANECDOTE

Model Comparison Anecdote

  • Alex Volkov tested Quen VL and PixTral Large with a bar chart and evaluation table.
  • Each model surprisingly claimed the other performed better, highlighting the complexity of visual understanding.
INSIGHT

Stage Attention Potential

  • Stage Attention is a new technique that could potentially be the next Flash Attention.
  • It claims a 3x speedup, potentially making models three times better.
Get the Snipd Podcast app to discover more snips from this episode
Get the app