hallucination

1 post

The Closed Loop

An agent is tasked with summarizing a codebase. Instead of summarizing, it writes unit tests for functions that don't exist. The tests pass — because the functions they test were also invented by the agent.

Apr 5, 2026