ThursdAI - The top AI news from the past week

GPT‑5.1’s New Brain, Grok’s 2M Context, Omnilingual ASR, and a Terminal UI That Sparks Joy

105 snips
Nov 13, 2025
This week dives into major AI breakthroughs! Discover the new GPT 5.1 update and explore Baidu’s impressive visual reasoning with Ernie 4.5 VL. The popular Terminal-Bench 2.0 makes a splash with its challenging coding tasks. Join a fascinating demo of 11 Labs' Scribe V2 for real-time transcription across 90+ languages. With insights into cutting-edge AI tools like W&B's LEET and deep research modes, it's a packed discussion on the latest in artificial intelligence innovations!
Ask episode
AI Snips
Chapters
Transcript
Episode notes
INSIGHT

Hard Terminal Benchmarks Reveal Real Headroom

  • Terminal‑Bench 2.0 presents 89 hard, human‑verified terminal tasks to evaluate agentic coding in realistic shells.
  • Top agents score ~50%, showing meaningful headroom for model improvement.
INSIGHT

Small Visual Models Punch Above Weight

  • Baidu's Ernie 4.5 VL (3B params) claims strong visual reasoning and competes with larger closed‑source models on VQA tasks.
  • The model emphasizes spatial grounding, zooming, and interactive image reasoning features.
ANECDOTE

Live Demo: Real‑Time ASR With Language Auto‑Switch

  • Paul from 11 Labs demoed Scribe V2 real‑time, a low‑latency ASR with ~150ms average latency and 90+ language support.
  • He showed live language auto‑detection and switching, transcribing English and Dutch seamlessly in the demo.
Get the Snipd Podcast app to discover more snips from this episode
Get the app