
#156 - OpenAI's Sora, Gemini 1.5, BioMistral, V-JEPA, AI Task Force, Fun!
Last Week in AI
00:00
Advancements in Generalist Computer Agents and Benchmarking Challenges
This chapter explores the new OS Copilot framework for developing self-improving computer agents, specifically highlighting the agent named Friday. It also discusses the Gaia benchmark, showcasing Friday's impressive performance in overcoming challenging tasks.
Transcript
Play full episode