ThursdAI - The top AI news from the past week cover image

ThursdAI - The top AI news from the past week

📆 ThursdAI - Feb 20 - Live from AI Eng in NY - Grok 3, Unified Reasoners, Anthropic's Bombshell, and Robot Handoffs!

Feb 20, 2025
Leonard Tang, Co-founder at Haize Labs, joins the conversation to discuss their innovative open source evaluation library, Verdict, aimed at improving AI judgment reliability. They dive into the fascinating capabilities of Grok 3, comparing its performance with competitors and addressing censorship challenges. Tang also highlights the impact of nepotism bias in AI models and how Verdict seeks to enhance evaluation efficiency. The podcast explores the exploratory advances in robotics, including robots learning to hand objects to one another, showcasing the exciting future of AI.
01:41:13

Episode guests

Podcast summary created with Snipd AI

Quick takeaways

  • The release of R1-1776 by Perplexity marks a significant move towards censorship-free dialogues by targeting sensitive topics in Chinese governance.
  • EVO2, developed by Arc Institute and NVIDIA, represents a groundbreaking advancement in genomics with its 40 billion parameters and extensive dataset.

Deep dives

Open Source LLMs and Controversial Fine-Tuning

A recent development in open source LLMs involves the fine-tuning of DeepSeq R1 to create what is called R1-1776, specifically aiming to remove Chinese propaganda from the model. This fine-tuning is controversial as it targets approximately 300 topics deemed sensitive by the Chinese government, allowing for more open discussions on issues like Tiananmen Square. The methodology involved selecting examples that fall within the Chinese censorship umbrella, showcasing a clear stance on freedom of speech. Remarkably, this adjustment was made without a decline in performance on evaluation metrics.

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode