
Perhaps It Is A Bad Thing That The World's Leading AI Companies Cannot Control Their AIs
Astral Codex Ten Podcast
00:00
How to Make Methamphetamine
AI motivational systems are sticking to their own alien nature regardless of what the AI's intellectual components know about what they should, in quotes believe. Sometimes when RLA-CHF does work, it's bad. I've yet to figure out whether this is related to the thing where I also sometimes do things which I can explain a bad, for example, eat delicious bagels instead of healthy vegetables. What happens when these three goals come into conflict? Here's a screen capture from Hacker News. chatGPT produces made-up non-existent references.
Transcript
Play full episode