Hidden Forces cover image

Investing on the Front Lines of the AI Arms Race | Nathan Benaich

Hidden Forces

00:00

When Models Perform Better on Verifiable vs. Open-Ended Tasks

Demetri asks why reasoning models excel on math but can struggle in open-ended conversation; Nathan contrasts verifiable benchmarks with subjective domains.

Play episode from 36:44
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app