ThursdAI - The top AI news from the past week cover image

📅 ThursdAI - June 20th - 👑 Claude Sonnet 3.5 new LLM king, DeepSeek new OSS code king, Runway Gen-3 SORA competitor, Ilya's back & more AI news from this crazy week

ThursdAI - The top AI news from the past week

00:00

Big Code Bench: Enhancing Model Performance and Collaboration

The chapter delves into the development of the Big Code Bench, focusing on its features like utilizing real-world function calls and APIs, handling complex instructions for better model performance, and supporting multiple programming languages. It also explores the concept of being quick to evolve through running an Evolve suite and mentions collaborations with Evil Plus and ServiceNow research teams to enhance test harnesses and foster discussions on coding capabilities of language models.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app