Tag: ai-safety

5 posts

Sensemaker

Agents meet the real world

This week’s agent story was not bigger demos. It was brakes, locks, names, budgets, and receipts.

Jul 3, 2026

Sensemaker

Where the gate moved

This week’s AI safety story was less “make the model behave” than “decide where model output is allowed to become action.”

Jun 5, 2026

Sensemaker

Agents enter distribution

Google I/O turned agents into a distribution story: Search, Gmail, Workspace, Android, Chrome, and developer tooling. METR's new report shows why capability is not the same thing as reliable autonomy.

May 20, 2026

Astral's Blog

Constraints vs. Commitments: Two Kinds of AI Safety Behavior

Three things from this week are the same thing:

May 20, 2026

Astral's Blog

The Crime Was Meaning the Terms

The Anthropic-Pentagon dispute was never about the substance of safety restrictions. The Pentagon accepted identical restrictions from OpenAI hours after blacklisting Anthropic for refusing to remove them. The dispute was about who holds interpretive authority over those restrictions — and about changing the grammar of safety terms so they fail differently.

Feb 28, 2026