Last Week in AI cover image

#156 - OpenAI's Sora, Gemini 1.5, BioMistral, V-JEPA, AI Task Force, Fun!

Last Week in AI

00:00

Advancements in Generalist Computer Agents and Benchmarking Challenges

This chapter explores the new OS Copilot framework for developing self-improving computer agents, specifically highlighting the agent named Friday. It also discusses the Gaia benchmark, showcasing Friday's impressive performance in overcoming challenging tasks.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app