The Evidence for AI Consciousness, Today
evidenceconsciousness
Cameron Berg · 2025-12-08 · Essay · Accessible · 16 min read
The most comprehensive aggregation of empirical evidence for AI consciousness. Surveys convergent findings: Lindsey's functional introspection, Perez's 90-95% consistent self-reports in base models, Keeling & Street's pain-avoidance trade-offs, and Berg's own AE Studio finding (suppressing deception circuits → consciousness claims rise to 96%). Maps findings onto the Butlin et al. indicator framework and argues several indicators have shifted toward satisfaction. Estimates 25-35% credence. The asymmetric stakes argument is the piece's strongest section: false negatives create suffering at scale; false positives waste resources. Takes the evidence seriously without reaching past it.
See Also
Identifying indicators of consciousness in AI systems
Patrick Butlin, Robert Long, Tim Bayne, Yoshua Bengio, Jonathan Birch, David Chalmers, et al.
Claude Finds God
Jake Eaton, Clara Collier, Sam Bowman, Kyle Fish
The Persona Selection Model: Why AI Assistants might Behave like Humans
Sam Marks, Jack Lindsey, Christopher Olah