
Discussion: Challenges with Unsupervised LLM Knowledge Discovery
LessWrong (Curated & Popular)
00:00
Challenges with Unsupervised LLM Knowledge Discovery
Exploring the reasons for inconsistent outcomes, including bugs and generalization failures, in unsupervised LLM knowledge discovery. Evaluating the original hypothesis with lower confidence, discussing difficulties in distinguishing human simulators from direct reporters, and limitations of consistency-based knowledge detection methods. Suggesting criteria for evaluating ELK methods and highlighting the need for suitable test beds for evaluation.
Play episode from 14:01
Transcript


