
Exploring AI Browsers
AppStories
00:00
Testing Agents on Practical Tasks
John suggests agent tests like printing crosswords to evaluate usefulness; they note niche successes on Reddit examples.
Play episode from 23:40
Transcript

John suggests agent tests like printing crosswords to evaluate usefulness; they note niche successes on Reddit examples.