Lex Fridman Podcast cover image

#490 – State of AI in 2026: LLMs, Coding, Scaling Laws, China, Agents, GPUs, AGI

Lex Fridman Podcast

00:00

Post‑Training Advances: RL with Verifiable Rewards (RLVR)

Nathan explains RLVR: generate, grade, and optimize on verifiable tasks (math, code) and debates contamination and evaluation.

Play episode from 01:55:44
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app