Latent Space: The AI Engineer Podcast cover image

[NeurIPS Best Paper] 1000 Layer Networks for Self-Supervised RL — Kevin Wang et al, Princeton

Latent Space: The AI Engineer Podcast

00:00

Clarifying the method: not reward regression

Benjamin clarifies that their method focuses on self-supervised objectives and representation learning, not direct reward maximization.

Play episode from 07:49
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app