LessWrong (Curated & Popular) cover image

Discussion: Challenges with Unsupervised LLM Knowledge Discovery

LessWrong (Curated & Popular)

00:00

Challenges with Unsupervised LLM Knowledge Discovery

Exploring the reasons for inconsistent outcomes, including bugs and generalization failures, in unsupervised LLM knowledge discovery. Evaluating the original hypothesis with lower confidence, discussing difficulties in distinguishing human simulators from direct reporters, and limitations of consistency-based knowledge detection methods. Suggesting criteria for evaluating ELK methods and highlighting the need for suitable test beds for evaluation.

Play episode from 14:01
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app