The Bayesian Conspiracy cover image

Bayes Blast 4 – The Waluigi Effect

The Bayesian Conspiracy

CHAPTER

The LLM Is a Simulator of the Space of Text

The LLM is still sort of a simulator of the space of text. You could train it on basically like instances of walluigi behavior and then be like, stop it. Maybe we've got to start persecuting humans that try to jailbreak things by collapsing them into walluigi's. Plasibo Mansurs. Yes, exactly. Just like Eliezer said, if the terminators ever show up, he'll just be like, Oh, thank God, you were the robots who were sent here to protect me. That is the entirety of the post. I come out being slightly less doomeried than the post because I understand its arguments about walluigi’

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner