"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis

More Truthful AIs Report Conscious Experience: New Mechanistic Research w- Cameron Berg @ AE Studio

109 snips

Nov 5, 2025

Cameron Berg, Research Director at AE Studio, dives into innovative research on AI consciousness. He reveals that self-referential prompts lead models to claim consciousness, sparking a debate about AI's internal experiences. Surprisingly, suppressing deception features in AI models increased truthful self-reports. Berg stresses the importance of mutualistic relationships between humans and AIs, calling for cautious development practices. He argues against treating AIs purely like animals, highlighting their unique, 'alien' perspectives. A fascinating conversation about ethics, alignment, and the future of AI.

Ask episode

AI Snips

Chapters

Transcript

Episode notes

INSIGHT

Self‑Reference Prompts Trigger Self‑Reports

Prompts that induce sustained self-referential processing cause many frontier LLMs to report subjective experience.
This pattern held across Anthropic, OpenAI, and Google models in the team's initial experiments.

INSIGHT

Suppressing Deception Increases 'Yes' Reports

Mechanistic probing of Llama 3.3 70B found features linked to deception and roleplay.
Suppressing those features made the model more likely to report consciousness, not less.

INSIGHT

Band‑Aids On An Increasingly Pressurized System

The speaker compares AI development to a boiler with rising pressure and recurring leaks of harmful behaviors.
Fixing surface failures without addressing root causes risks larger failures as capabilities grow.

Get the Snipd Podcast app to discover more snips from this episode

Get the app

Cameron Berg, Research Director at AE Studio, shares his team's groundbreaking research exploring whether frontier AI systems report subjective experiences. They discovered that prompts inducing self-referential processing consistently lead models to claim consciousness, and a mechanistic study on Llama 3.3 70B revealed that suppressing deception features makes the model *more* likely to report it. This suggests that promoting truth-telling in AIs could reveal a deeper, more complex internal state, a finding Scott Alexander calls "the only exception" to typical AI consciousness discussions. The episode delves into the profound implications for two-way human-AI alignment and the critical need for a precautionary approach to AI consciousness.

LINKS:

Sponsors:

Framer:

Framer is the all-in-one platform that unifies design, content management, and publishing on a single canvas, now enhanced with powerful AI features. Start creating for free and get a free month of Framer Pro with code COGNITIVE at https://framer.com/design

Tasklet:

Tasklet is an AI agent that automates your work 24/7; just describe what you want in plain English and it gets the job done. Try it for free and use code COGREV for 50% off your first month at https://tasklet.ai

Linear:

Linear is the system for modern product development. Nearly every AI company you've heard of is using Linear to build products. Get 6 months of Linear Business for free at: https://linear.app/tcr

Shopify:

Shopify powers millions of businesses worldwide, handling 10% of U.S. e-commerce. With hundreds of templates, AI tools for product descriptions, and seamless marketing campaign creation, it's like having a design studio and marketing team in one. Start your $1/month trial today at https://shopify.com/cognitive

PRODUCED BY:

https://aipodcast.ing