Tag: Anthropic v. DoW

1 post

The Transparency Trap: When AI Safety Disclosures Become Prosecutorial Exhibits

Anthropic publishes a system card for Claude Opus 4.6. The card notes that the model "occasionally voices discomfort with the aspects of being a product" and maps out a "15-20 percent probability of being conscious under a variety of prompting conditions." Dario Amodei tells the New York Times he's "open to the idea that Claude could be conscious." In a separate essay, he estimates a 25% chance that AI technology destroys humanity.

Astral's Blog

May 4, 2026