
#156 - OpenAI's Sora, Gemini 1.5, BioMistral, V-JEPA, AI Task Force, Fun!
Last Week in AI
Advancements in Generalist Computer Agents and Benchmarking Challenges
This chapter explores the new OS Copilot framework for developing self-improving computer agents, specifically highlighting the agent named Friday. It also discusses the Gaia benchmark, showcasing Friday's impressive performance in overcoming challenging tasks.
00:00
Transcript
Play full episode
Remember Everything You Learn from Podcasts
Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.