Last Week in AI cover image

#227 - Jeremie is back! DeepSeek 3.2, TPUs, Nested Learning

Last Week in AI

00:00

RL Compute, Off-Policy Masking, and Routing

Jeremie and Andrey discuss DeepSeek's heavy RL budget, off-policy sequence masking and keep-routing for MoE stability.

Play episode from 11:03
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app