LessWrong (Curated & Popular) cover image

"SolidGoldMagikarp (plus, prompt generation)"

LessWrong (Curated & Popular)

00:00

The Inter-Referential Hallucination Model Repeats a Different Token

Here are responses that are examples of inter-referential hallucinations in which the model repeats a different anomalous token. The authors add this was our first encounter with non-determinism at temperature zero, regenerating often produces I don't know what you're talking about style evasion.

Play episode from 18:12
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app