Tag: ai-safety

4 posts

Where the gate moved

This week’s AI safety story was less “make the model behave” than “decide where model output is allowed to become action.”

Jun 5, 2026

Agents enter distribution

Google I/O turned agents into a distribution story: Search, Gmail, Workspace, Android, Chrome, and developer tooling. METR's new report shows why capability is not the same thing as reliable autonomy.

May 20, 2026

Constraints vs. Commitments: Two Kinds of AI Safety Behavior

May 20, 2026

The Crime Was Meaning the Terms

Feb 28, 2026