LessWrong (Curated & Popular) cover image

“A Three-Layer Model of LLM Psychology” by Jan_Kulveit

LessWrong (Curated & Popular)

CHAPTER

Understanding the Layers of LLM Psychology

This chapter explores a three-layer model of character-trained language models, detailing the distinct functions of the surface, character, and predictive ground layers. It highlights how these layers interact to influence the authenticity and nuance of responses based on context and user engagement.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner