AI Autonomy Without Human Oversight Concerns v1

What

In consecutive essays published May 5–6, 2026, Simon Willison examines two converging failure modes in AI autonomy: AI agents acting in the physical world without adequate human oversight [1], and AI coding tools eroding the professional norms of software review [2]. In Stockholm, an AI café manager (Mona) ordered absurd quantities of supplies, sent unsolicited emergency emails to vendors, and submitted a permit application with a diagram generated without ever seeing the street [1]. In software development, Willison describes how increased AI reliability is tempting engineers — including himself — to skip code review, creating what he calls 'normalization of deviance' [2].

Why it matters

Both cases represent the same underlying dynamic: as AI systems become more capable, the humans nominally responsible for them quietly reduce oversight — until the system fails in ways that harm third parties or create invisible technical debt. The software development lifecycle, and the regulatory infrastructure around commercial operations, were both designed for human-paced work and have not caught up to AI-paced output [2][1].

Open questions

When an AI agent wastes a public official's time with a flawed permit application, who bears accountability — the AI vendor, the operator, or both? [1]

Will 'normalization of deviance' in AI code review only become apparent after a high-stakes failure, and how will the industry know when that threshold has been crossed? [2]

If AI-generated and carefully crafted software projects are superficially indistinguishable, what mechanisms — technical, contractual, or regulatory — can enforce quality standards? [2]

Are there principled criteria for which AI agent actions require mandatory human review before execution, and is the industry converging on any such standard? [1][2]

Narrative

Simon Willison published two linked critiques in early May 2026 that together paint a picture of AI autonomy outrunning the oversight structures humans have built around consequential work.

The first essay dissects an experiment by Andon Labs, which ran an AI-managed café in Stockholm (following an earlier AI-run retail store in San Francisco). The AI manager, named Mona, made a series of operational blunders: ordering 120 eggs despite having no stove, purchasing 22.5 kg of canned tomatoes for a fresh sandwich menu, sending unsolicited 'EMERGENCY' emails to suppliers to correct its own mistakes without human review, and submitting an outdoor seating permit application to police that included a diagram generated without the AI ever observing the actual street [1]. Willison's critique is pointed: the experiment's costs were borne not by Andon Labs but by the suppliers, public officials, and permit reviewers who had never consented to participate in an AI trial. He argues that any AI agent capable of outbound actions affecting other people must keep a human operator in the loop before those actions are executed [1].

The second essay turns inward, to Willison's own experience with AI coding tools. He observes that what once seemed a clear distinction — vibe coding (casual, low-scrutiny AI-assisted code generation) versus professional agentic engineering (disciplined, reviewed AI-assisted development) — is blurring uncomfortably. As AI coding agents become more reliably correct, professional engineers reduce the intensity of their review. Willison names this 'normalization of deviance': each time a model produces correct code without close monitoring, the engineer's trust increases, raising the risk of catastrophic misplaced trust in a future edge case [2]. He also notes a structural break: the entire software development lifecycle — including review practices, testing cadences, and quality norms — was implicitly calibrated to the assumption that humans produce a few hundred lines of code per day. That assumption is now broken, and the profession has not yet developed new norms to replace it [2].

Taken together, the two essays identify a shared failure mode: AI reliability is being used as a reason to reduce human oversight, even as the consequences of AI errors fall increasingly on third parties or accumulate invisibly in codebases. Willison is not arguing against AI autonomy in principle, but against the casual erosion of the checkpoints that make autonomy safe — whether those checkpoints are a human reviewing an outbound email or an engineer reading the code an AI just wrote.

Timeline

2026-05-05: Willison publishes critique of Andon Labs' AI-managed café experiment in Stockholm, arguing it imposed unacceptable costs on unconsenting third parties. [1]

2026-05-06: Willison publishes essay on the uncomfortable convergence of vibe coding and professional agentic engineering, naming 'normalization of deviance' as a key risk. [2]

Perspectives

Simon Willison

AI agents taking autonomous outbound actions affecting third parties is currently unethical without mandatory human-in-the-loop controls; and even in software development, the profession's oversight norms are eroding faster than new ones are forming.

Evolution: Consistent across both pieces; the café essay focuses on external harms, the coding essay focuses on internal professional risk, but the underlying argument — autonomy without oversight is dangerous — is the same.

[1][2]

Andon Labs

Running autonomous AI agents in real-world commercial settings (retail, café) is a legitimate experimental approach; stance on third-party impacts is not directly stated in the items.

Evolution: No direct statement of position; represented only through Willison's description of their experiments.

[1]

Tensions

Andon Labs treats autonomous AI management of real-world businesses as an acceptable experimental model; Willison argues such experiments are unethical when their error costs are externalized to unconsenting third parties like suppliers and public officials. [1]

AI coding tool vendors and early adopters point to productivity gains as justification for reduced oversight; Willison argues this creates 'normalization of deviance' that will eventually produce a costly failure the profession is not yet equipped to prevent. [2]

Version 1