LessWrong (Curated & Popular) cover image

'Simulators' by Janus

LessWrong (Curated & Popular)

00:00

How to Guess the Laws of Physics?

The reason we don't see a bunch of simulated alternate universes after humans guess the laws of physics is because our reality has a huge state factor. This applies even if you've guessed the wrong laws, your simulation will just systematically diverge from reality. Models trained with the strict simulation objective are directly incentivized to reverse engineer the semantic physics of the training distribution. I propose this as a description of the archetype targeted by self-supervised predictive learning. It's in contrast to RL's archetype of an agent optimized to maximize free parameters, such as action trajectories, relative to reward function.

Play episode from 01:19:35
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app