The Information Machine

AI Labs Simultaneously Acknowledge Recursive Self-Improvement Threshold

open · v1 · 2026-06-05 · 71 items

What

Both Anthropic and OpenAI have, within days of each other, publicly acknowledged that current AI systems show early signs of recursive self-improvement (RSI) — the capacity for AI to accelerate its own development without direct human oversight. Anthropic disclosed that Claude authored more than 80% of its production code merged in May 2026 [2][1] and called for a global slowdown in frontier AI development [3][7]. OpenAI's concurrent policy blueprint stated that the company sees 'early signs of recursive self-improvement in today's systems' and called RSI 'potentially the most consequential frontier safety issue of the coming decade' [4][5]. Both disclosures are generating debate about whether the labs are genuinely alarmed or strategically positioning ahead of regulation.

Why it matters

If two leading labs independently judge that systems with some capacity for self-directed capability improvement are already operational, the window to establish binding oversight before RSI becomes self-sustaining is narrower than most policy timelines assume. Both labs simultaneously have strong financial incentives to shape regulation in ways that entrench incumbents, which complicates the signal from their public statements.

Open questions

  • Does Claude writing 80%+ of Anthropic's own production code [2] constitute meaningful RSI, or is human-directed code generation categorically different from autonomous self-improvement where humans no longer set goals or review outputs?

  • Will OpenAI's proposed federal preemption of state frontier safety laws [5] eliminate state oversight without a congressional backstop — the scenario Zvi Mowshowitz identifies as the most dangerous element of the blueprint [5]?

  • Is a coordinated global slowdown verifiable in practice? Commentators note that no existing mechanism can audit compliance across jurisdictions [13][14].

  • How much of Anthropic's slowdown call is shaped by its competitive position — the company confidentially filed an S-1 at roughly a $965B valuation shortly before issuing the call [6][7][8]?

Narrative

In the first week of June 2026, Anthropic and OpenAI independently released documents that together represent the most explicit public acknowledgment either lab has made that recursive self-improvement may already be underway in deployed systems. Anthropic published a blog post titled 'When AI Builds Itself,' disclosing that Claude authored more than 80% of Anthropic's production code merged in May 2026, up from low single digits before Claude Code launched in early 2025 [1]. The average Anthropic engineer now merges eight times as much code per day as in 2024 [2]. Separately, Claude Mythos Preview had already accelerated Anthropic's model-training code by roughly 52x, compared with approximately 3x for Claude Opus 4 the prior year, and Claude's success rate on open-ended coding tasks reached 76%, up 50 points in six months [2].

Anthropic paired this disclosure with a call for a global slowdown in frontier AI development, arguing that future models capable of autonomously running research experiments could accelerate AI development beyond human control [3]. OpenAI's policy blueprint, released around the same time, called RSI 'potentially the most consequential frontier safety issue of the coming decade' and acknowledged 'early signs' of it in current systems [4][5]. OpenAI's proposed governance architecture would establish a federal oversight body (CAISI) with mandatory evaluation authority but no power to block deployments, and would build a national framework on existing state laws — California's SB 53, New York's RAISE Act, and Illinois's SB 315 — while preempting those states' frontier safety statutes with federal law [5].

Analyst Zvi Mowshowitz described the OpenAI blueprint as 'highly reasonable, well exceeding expectations,' but identified the preemption clause as the most dangerous provision: once states surrender their regulatory leverage, he argues, Congress may never act to replace it, leaving oversight effectively without enforcement teeth [5]. The credibility of both labs' public positions is complicated by financial context. Anthropic's slowdown call came shortly after the company confidentially filed an S-1 at a valuation of approximately $965 billion [6][7]. Critics — including a New York Post report and multiple commentators — argued that a global slowdown would disproportionately disadvantage competitors that have not yet secured Anthropic's market position [7][8].

On the research side, alignment researcher Seth Herd published a detailed agenda on the Alignment Forum arguing that the first 'takeover-capable AI' will most likely be an LLM augmented with persistent memory and executive function rather than a novel architecture, and that existing alignment techniques do not address the goal drift and ontology shift such systems would introduce [9]. An ICLR 2026 workshop on RSI [10][11], concurrent with these industry announcements, reflects how quickly the concept has moved from speculative to mainstream in the technical research community. TechCrunch observed that RSI 'is just as hard to pin down' as AGI once was [12], a caution that applies to governance proposals built on the term.

Timeline

  • 2025-02: Claude Code reaches research preview; Claude's share of Anthropic merged code is in the low single digits. [1]
  • 2025-05: Claude Opus 4 accelerates Anthropic's model-training code by approximately 3x. [2]
  • 2026-05: Claude authors more than 80% of Anthropic's production code merged in May 2026; per-engineer code output reaches 8x the 2024 baseline. [2][1]
  • 2026-05: Claude Mythos Preview accelerates model-training code by approximately 52x; Claude's success rate on open-ended coding tasks reaches 76%, up 50 points in six months. [2]
  • 2026-05-28: TechCrunch reports that RSI 'is the new AGI' and is just as difficult to define precisely, raising questions about what governance proposals built on the term can actually bind. [12]
  • 2026-06-03: OpenAI publishes 'Democratic Governance of Frontier AI' policy blueprint, acknowledging early RSI signs in current systems and proposing CAISI as a federal oversight body. [18][4][5]
  • 2026-06-04: Anthropic publishes 'When AI Builds Itself,' disclosing Claude's 80%+ code authorship and calling for a global slowdown in frontier AI development. [3][7][16]
  • 2026-06-05: Zvi Mowshowitz publishes detailed analysis of OpenAI blueprint, praising its substance while warning that its federal preemption clause could strip states of safety oversight permanently. [5]
  • 2026-06-05: Seth Herd publishes a research agenda on the Alignment Forum arguing augmented LLMs are the most likely first takeover-capable AI and that current alignment techniques do not address this path. [9]
  • 2026-06-05: Critics and commentators note the timing of Anthropic's slowdown call relative to its confidential S-1 filing at a ~$965B valuation. [6][8][7]

Perspectives

Anthropic

Believes its current models may be approaching the RSI threshold and is calling for a global coordinated slowdown in frontier AI development before autonomous self-improvement becomes difficult to control.

Evolution: Consistent with long-stated safety mission, but the public RSI disclosure and global-slowdown call are the most explicit external acknowledgment of near-term risk Anthropic has made to date.

OpenAI

Acknowledges 'early signs' of RSI in current systems, calls it the top frontier safety issue of the coming decade, and proposes a federal oversight framework with mandatory evaluations but no deployment veto.

Evolution: More explicit RSI acknowledgment than prior public statements; proposing governance architecture rather than a development slowdown.

Zvi Mowshowitz

Guardedly positive on OpenAI's blueprint — praises its substance as exceeding expectations — but identifies federal preemption of state safety laws as the most dangerous provision, warning that state leverage, once surrendered, may not be recovered if Congress fails to act.

Evolution: No prior synthesis to compare; stance is analytically consistent with his general wariness about governance proposals that trade immediate state power for uncertain federal action.

Critics (NY Post, market commentators)

Argue Anthropic's call for a global slowdown is competitive strategy rather than safety response: the company filed an S-1 at a ~$965B valuation shortly before issuing the call, and a slowdown would disproportionately harm competitors with less-established positions.

Evolution: No prior synthesis; skeptical posture from first appearance.

Seth Herd (Alignment Forum)

Argues the first takeover-capable AI will be an augmented LLM rather than a novel architecture; warns that existing alignment research neglects mechanistic prediction of likely failure modes and that motivated reasoning is splitting the safety community into unproductive camps.

Evolution: No prior synthesis; new voice.

Rohan Paul / The Neuron (analytical commentators)

Treat Anthropic's 80%+ code-authorship disclosure as a key data point for AI-assisted software development while noting the RSI loop is not yet closed — humans still direct goals and review outputs — making the 'autonomous self-improvement' framing premature.

Evolution: No prior synthesis; new voices.

Tensions

  • Anthropic frames its slowdown call as a genuine safety response to near-term RSI risk; critics argue the call is regulatory strategy timed to entrench Anthropic's market position after its IPO filing at a ~$965B valuation. [3][6][7][8]
  • OpenAI's blueprint proposes federal preemption of state frontier safety laws; Mowshowitz argues this is the single most dangerous element because Congress may never enact the promised federal replacement once states surrender leverage. [5]
  • Both Anthropic and OpenAI treat RSI as a near-term observable phenomenon; TechCrunch and other analysts note the term remains as difficult to define precisely as AGI once was, leaving the scope of any RSI-based governance proposal ambiguous. [12][4][3]
  • Anthropic's disclosure that Claude now writes 80%+ of its own production code is read by some as evidence of meaningful RSI onset, and by others — including The Neuron — as a productivity milestone where humans still set goals and review outputs, not a loss of developmental control. [2][1][21][22]
  • Multiple commentators argue a global AI slowdown cannot be verified or enforced by any existing institution; Anthropic has not specified what verification mechanism it envisions. [13][14]

Status: active and growing

Sources

  1. [1] Anthropic just disclosed that Claude now writes more than 80% of the production code it merges. — Rohan Paul Twitter (2026-06-05)
  2. [2] 😺 Anthropic: AI Is Building AI now — The Neuron (2026-06-05)
  3. [3] Anthropic just called for a global way to slow frontier AI because its own models may be approaching recursive self-impr… — Rohan Paul Twitter (2026-06-05)
  4. [4] Peter Wildeford🇺🇸🚀 on X: "OPENAI: "We also see early signs of recursive self-improvement in today's systems". RSI is "potentially the most consequential frontier safety issue of the coming decade."" / X — reactive:rsi-governance-moment
  5. [5] OpenAI Offers A New Policy Blueprint — Zvi's AI Roundups (2026-06-05)
  6. [6] The company that just confidentially filed its S-1 for a trillion-dollar IPO published a blog post four days later askin... — reactive:rsi-governance-moment (2026-06-05)
  7. [7] Anthropic calls for global AI slowdown after $965B valuation. Critics claim it's just to hobble competition. — reactive:rsi-governance-moment
  8. [8] @unusual_whales Anthropic floated the idea of a global pause on frontier AI development right after securing $65 billion... — reactive:rsi-governance-moment (2026-06-05)
  9. [9] My research agenda and work — Alignment Forum (2026-06-05)
  10. [10] ICLR 2026 Workshop on AI with Recursive Self-Improvement — reactive:rsi-governance-moment
  11. [11] ICLR 2026 Workshop: AI with Recursive Self-Improvement — reactive:rsi-governance-moment
  12. [12] RSI is the new AGI — and it's just as hard to pin down | TechCrunch — reactive:rsi-governance-moment
  13. [13] @claudeai 8. A global pause or slowdown cannot be verified by anyone — reactive:rsi-governance-moment (2026-06-04)
  14. [14] Could a global slowdown in frontier AI model development really happen? — reactive:rsi-governance-moment (2026-06-04)
  15. [15] Anthropic Calls for Frontier AI Freeze to Prevent Self-Building Tech - PYMNTS.com — reactive:rsi-governance-moment
  16. [16] Anthropic calls for pause of global AI development — reactive:rsi-governance-moment
  17. [17] Anthropic's article 'When AI Builds Itself' reveals progress in autonomous code writing and improvement, nearing 'recurs... — reactive:rsi-governance-moment (2026-06-05)
  18. [18] [PDF] Democratic Governance of Frontier AI - OpenAI — reactive:rsi-governance-moment
  19. [19] OpenAI’s new policy blueprint for AI imagines a role for government | FedScoop — reactive:rsi-governance-moment
  20. [20] https://t.co/ux9SUEPVWG — reactive:rsi-governance-moment (2026-06-05)
  21. [21] "Claude Code wrote 80% of its own code" - anthropic dev : r/singularity — reactive:rsi-governance-moment
  22. [22] Did Claude Code Write 80 Percent of its Own Code? - SmythOS — reactive:rsi-governance-moment