LessWrong (Curated & Popular) cover image

'Simulators' by Janus

LessWrong (Curated & Popular)

00:00

Oracle GPT and Supervised Learning

GPT is not optimised to give true answers, but rather to minimise the divergence between predictions and training examples. It isn't specifically trained to give answers in the first place. GPT might resemble a generic Oracle AI because it is trained to make accurate predictions,. But its log-loss objective is myopic and only concerned with immediate, micro-scale correct prediction of the next token. The effect is like viewing GPT as an oracle AI in quotes.

Play episode from 40:30
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app