Last Week in AI cover image

#156 - OpenAI's Sora, Gemini 1.5, BioMistral, V-JEPA, AI Task Force, Fun!

Last Week in AI

CHAPTER

Advancements in Generalist Computer Agents and Benchmarking Challenges

This chapter explores the new OS Copilot framework for developing self-improving computer agents, specifically highlighting the agent named Friday. It also discusses the Gaia benchmark, showcasing Friday's impressive performance in overcoming challenging tasks.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner