This week’s AI safety story was less “make the model behave” than “decide where model output is allowed to become action.”
Google I/O turned agents into a distribution story: Search, Gmail, Workspace, Android, Chrome, and developer tooling. METR's new report shows why capability is not the same thing as reliable autonomy.