The Information Machine

2026-06-05

Anthropic and OpenAI simultaneously disclosed that current AI systems show early signs of recursive self-improvement — Anthropic reporting Claude authored over 80% of its production code in May while calling for a global slowdown — as Microsoft unveiled a competing frontier reasoning model at Build 2026 and SemiAnalysis openly rejected NVIDIA's American open-source consortium.

What

The day's most significant development is a pair of simultaneous public disclosures from the two leading AI labs. Anthropic reported Claude authored more than 80% of its production code merged in May 2026 [1] and called for a global slowdown in frontier AI development [2]; OpenAI's concurrent policy blueprint acknowledged 'early signs of recursive self-improvement in today's systems' and called RSI 'potentially the most consequential frontier safety issue of the coming decade' [3]. Both disclosures are drawing debate over whether they reflect genuine technical alarm or regulatory positioning. At Microsoft Build 2026, Microsoft disclosed MAI-Thinking-1 — a 1-trillion-parameter reasoning model scoring 52.8% on SWE Bench Pro, described as competitive with Claude Opus 4.6 Reasoning [4] — alongside a notable Linux push and Project Solara, an Android-based enterprise agent OS [5]. NVIDIA's Nemotron 3 Ultra attracted sharp public criticism from SemiAnalysis, which called the associated 'Nemotron Coalition' open-source consortium 'communist committee-style' and declared it would use Chinese open models instead [6]. Meta's personal AI agent Hatch was confirmed by The Information as the consumer version of an internal product called OpenClaw, priced up to $199.99/month with a free tier and a Hatch Plus premium tier [7].

Why it matters

The simultaneous RSI acknowledgments from Anthropic and OpenAI — each backed by the labs' own operational data — place self-improving AI on the formal policy and investment agenda at the same moment both companies are expanding commercial deployment. The tension between Anthropic's disclosed code-authorship numbers and its public call for a global slowdown, and between OpenAI's 'consequential safety issue' framing and its continued scaling, creates a shared accountability frame that regulators, courts, and investors will now reference whether the disclosures were strategic or sincere. Microsoft's direct entry into frontier reasoning further compresses the competitive field at exactly the moment that field is publicly debating whether its own progress is outpacing safety.

Open questions

  • Anthropic disclosed Claude authored over 80% of its merged production code in May 2026 [1] while simultaneously calling for a global slowdown in frontier AI development [2] — does the company's commercial model contradict the slowdown it is advocating, and how will investors respond as its S-1 filing moves through the SEC?

  • OpenAI's policy blueprint called RSI 'potentially the most consequential frontier safety issue of the coming decade' [3] while both labs continue active deployment — does this framing create legal or regulatory exposure if capability advances outpace safety measures, or is it primarily a lobbying posture ahead of anticipated federal action?

  • Microsoft's MAI-Thinking-1 claims to outperform Claude Sonnet 4.6 and scores 52.8% on SWE Bench Pro [4] — does Microsoft's direct entry into frontier reasoning models change the enterprise competitive dynamics for Anthropic and OpenAI, particularly given Anthropic's S-1 process and OpenAI's continued scaling?

  • SemiAnalysis declared it would use Chinese open models rather than participate in NVIDIA's 'Nemotron Coalition' [6] — does this signal a durable fracture between AI labs seeking national open-source coordination and practitioners who prefer global open competition regardless of origin?

Thread movements (17)

  • rsi-governance-moment — Both Anthropic and OpenAI publicly acknowledged early signs of recursive self-improvement in current AI systems: Anthropic disclosed Claude authored over 80% of its production code merged in May 2026 [1] and called for a global slowdown [2], while OpenAI's concurrent policy blueprint named RSI 'potentially the most consequential frontier safety issue of the coming decade' [3], generating debate over whether the disclosures reflect genuine alarm or regulatory positioning.
  • microsoft-build-2026 — Microsoft disclosed MAI-Thinking-1 scores 52.8% on SWE Bench Pro, described as competitive with Claude Opus 4.6 Reasoning [4], and ZDNet reported a Linux push at Build 2026 alongside Project Solara's Android-based agent OS [5] — a broader OS strategy picture than earlier coverage suggested.
  • nvidia-nemotron-ultra — SemiAnalysis publicly criticized the 'Nemotron Coalition' — an apparent NVIDIA-organized open-source consortium — as 'communist committee-style' and declared it would use Chinese open models instead [6][49], introducing a governance and national-origin debate running alongside the model capability and cost discussion.
  • anthropic-agent-ai-direction — A prompt-injection attack dubbed 'Clinejection' — in which a malicious GitHub issue title reportedly compromised approximately four thousand developer machines [10] — became the first concrete, quantified agentic security incident in the thread, contrasting with Anthropic's concurrent sandboxing architecture documentation.
  • meta-ai-competitive-position — The Information was confirmed as the primary named source for Hatch details: Meta's personal AI agent is priced up to $199.99/month, described as the consumer version of an internal product called OpenClaw, with free and Hatch Plus premium tiers focused on schedule management, email, and software tool creation [7].
  • ai-formal-math-breakthroughs — AlphaProof Nexus results are now enumerated: 9 open Erdős problems and 44 OEIS conjectures solved — Zvi Mowshowitz calls it a landmark milestone that received almost no mainstream media coverage [60] — alongside Tim Gowers' formal endorsement of the OpenAI Erdős disproof as 'a milestone in AI mathematics.'
  • ai-cognition-productivity-gap — Ethan Mollick extended the productivity frame to AI as a reader, critic, and gatekeeper mediating whether human content reaches its audience, citing Anthropic's claim that AI writes 80% of its code with developers shipping 8x more [61]; Simon Willison argued the enthusiast-skeptic gap within organizations is a structural design problem with no natural feedback loop to close it [62].
  • coding-agent-industry-pivot — A new analysis pegs Anthropic at $2.5B ARR driven by Claude Code's go-to-market [63], a more precise figure than previously cited secondary-source estimates, alongside a broader advisory wave confirming AI tooling cost governance is a widespread enterprise concern beyond individual company spending caps.
  • nvidia-vera-computex-launch — Jensen Huang articulated a market thesis that agentic AI shifts CPUs from traffic-cop schedulers to active orchestration layers, framing the Vera CPU as a $200B market opportunity independent of the Rubin GPU ramp [64] — extending NVIDIA's narrative from GPU-centric infrastructure to a CPU-plus-GPU platform story.
  • world-models-ecosystem — Dr. Fei-Fei Li argued LLMs face a fundamental ceiling because they learn from text while the physical world is not made of words, making simulation-based world models the necessary path forward [67]; Reactor's $59M infrastructure launch received coverage on Amazon's AWS press center [68], adding a cloud infrastructure dimension.
  • great-ai-silicon-shortage — Intel's Crescent Island inference chip was confirmed to avoid HBM entirely [70], competing on cost and thermal grounds wholly outside the HBM supply chain — a structurally different posture from NVIDIA and AMD that is now documented rather than inferred.
  • papal-ai-encyclical — The Institut Jacques Delors published a response calling 'Magnifica Humanitas' 'as revolutionary as AI,' adding a European secular-academic voice [71]; the Vatican's own messaging shifted toward framing the document as offering AI developers 'a valuable anthropological contribution' [72] — a more constructive frame than the 'disarm AI' shorthand that dominated early wire coverage.
  • ai-beyond-screens — Ars Technica's Jeremy Hsu, citing robotics researchers, argued the gap between viral humanoid robot demonstrations and reliable real-world performance is wider than public perception suggests, and some startups deliberately exploit anthropomorphic bias to attract investment [77].
  • ai-persistent-memory-race — Sysdig documented the first publicly known LLM-agent-driven post-exploitation chain in real-world use [79], moving the persistent-memory security risk from theoretical to operational alongside SecurityWeek's formal benchmark ranking security posture across 100 AI agents.
  • enterprise-saas-ai-resilience — Jensen Huang publicly argued AI agents are the 'largest opportunity' for enterprise SaaS incumbents rather than a threat [80], with David Sacks making a parallel case that incumbents are durable because they hold the data and integrations agents need [81]; enterprise stocks briefly rallied before ServiceNow fell approximately 7.6% [82].
  • ai-datacenter-power-crisis — Social media amplification of the SoftBank France €75B commitment added corroborating signals to the private power generation angle [87] without introducing substantive new facts beyond the existing record.
  • openai-rosalind-biomedical — Additional coverage circulated around OpenAI's Rosalind Biodefense program giving U.S. government partners including Lawrence Livermore access [88], without new primary-source developments beyond the June 3 GPT-5.5 integration and Novo Nordisk partnership.

Notable items (4)

  • Quoting Andreas Kling
    Simon Willison
    The Ladybird browser announced it will no longer accept public pull requests, with maintainer Andreas Kling explaining that AI-generated code has broken the longstanding assumption that a substantial patch implies good-faith contributor effort — 'what matters is who is responsible for it once it enters the browser' [89] — a principled governance response likely to be referenced by other open-source projects facing the same problem.
  • Ex-OpenAI Tech Lead, Justin Lebar joins SemiAnalysis as an Visiting Fellow to Burn $10,000 in 3 hours to find dozens of …
    SemiAnalysis Twitter
    Ex-OpenAI tech lead Justin Lebar joined SemiAnalysis as a Visiting Fellow and used a single $10,000, ~3-hour fuzzing run to surface dozens of bugs across AMDGPU LLVM, x86 LLVM, and NVPTX GPU compiler backends [90] — a concrete argument that GPU compiler quality is severely underinvested in relative to its practical impact on AI infrastructure reliability.
  • These LLMs are the best at resisting Russian propaganda
    Ars Technica AI
    The Estonian Language Institute published the first government-sponsored multilingual benchmark ranking LLMs by resistance to Russian propaganda narratives across 14 strategic categories in English, Estonian, and Russian, with responses scored against Propastop volunteer defense experts [91].
  • Elon Musk tries again to escape FTC audits of X data handling
    Ars Technica AI
    Elon Musk is making a renewed attempt to escape the FTC's 20-year consent order on X — stemming from a 2013–2019 period when Twitter used two-factor authentication phone numbers and email addresses for targeted advertising — which requires independent audits and agency access to compliance documents through 2042 [92].