ThursdAI - The top AI news from the past week cover image

ThursdAI - May 2nd - New GPT2? Copilot Workspace, Evals and Vibes from Reka, LLama3 1M context (+ Nous finetune) & more AI news

ThursdAI - The top AI news from the past week

00:00

Advancements in AI Tools and Models

This chapter provides updates on various AI tools and software, discussing the release of new projects like itown and Jamba instruct, as well as the introduction of models such as Quinn 1.5 and GPT2-chat. It explores the significance of Large Language Models (LMCs) like Phi3 and the challenges of accurate evaluations, including the impact of diverse benchmarks and data contamination issues. The episode emphasizes the importance of evaluating models carefully, particularly in areas like Vibes eval by Rekka AI and correlations between model performance and external annotators.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app