ThursdAI - The top AI news from the past week cover image

GPT‑5.1’s New Brain, Grok’s 2M Context, Omnilingual ASR, and a Terminal UI That Sparks Joy

ThursdAI - The top AI news from the past week

00:00

Terminal‑Bench 2.0 and reproducible agent evals

Alex introduces Terminal‑Bench 2.0, its 89 hard terminal tasks, Harbor harness, and why reproducible agent benchmarks matter.

Play episode from 01:10
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app