2026-05-22

Anthropic's Mythos model is simultaneously driving US-China frontier AI diplomacy and emergency bank-CEO briefings from Treasury and the Federal Reserve, while Trump's voluntary AI oversight executive order was killed by tech industry lobbying before it could be signed.

What

The most consequential disclosure today is that Treasury Secretary Bessent and Federal Reserve Chair Powell held a joint emergency meeting with US bank CEOs specifically about Anthropic's Mythos model's cybersecurity risks [1][2] — before the US-China Beijing summit — establishing that a single AI system is now shaping both domestic financial regulation and high-level bilateral diplomacy simultaneously. In Washington, the Trump administration canceled a modest voluntary AI safety executive order that would have required frontier AI firms to share models with the government 90 days before public release, after top AI CEOs declined the signing ceremony and Elon Musk and Mark Zuckerberg successfully lobbied the administration's accelerationist faction to pull it — with some executives reportedly already mid-flight to the Oval Office when the event was canceled [3]. On a sharply different register, AI formal mathematics crossed a landmark: OpenAI's unreleased general-purpose reasoning model reportedly disproved the Erdős unit distance conjecture, open since 1946 [4], drawing coverage from Nature, Quanta Magazine, and The Guardian alongside separate peer-reviewed results from Harmonic and Google DeepMind. The online information ecosystem continued accumulating documented damage: the New York Times faced a second AI hallucination incident in which fabricated quotes appeared in a book review of 'The Future of Truth' — a work explicitly about AI-generated misinformation [5] — while Google AI Overviews are now quantified as cutting publisher referral traffic by 25–42% [6][7] and independent research confirms half of all email spam is AI-generated [8].

Why it matters

The convergence of Mythos driving both emergency financial-sector briefings and US-China diplomacy suggests that frontier AI risk is no longer a future-tense policy concern — a specific deployed model has become the immediate threat that governments and central banks are organizing around. The simultaneous collapse of even a voluntary domestic AI oversight mechanism under industry lobbying [3] illustrates a deepening gap: the urgency implied by the emergency briefings is not matched by any governance infrastructure capable of acting on it.

Open questions

If Treasury and the Fed were briefing bank CEOs about Anthropic's Mythos cybersecurity risks [1][2] before the Beijing summit, what specifically does Mythos do that triggered both domestic financial-sector emergency coordination and bilateral diplomacy — and how does that square with Anthropic's simultaneous $30 billion fundraise [9] and public commercial expansion?
OpenAI's reasoning model reportedly disproved the Erdős unit distance conjecture [4], open since 1946, but the model is unreleased and the result has not been independently verified: does this represent a genuine mathematical breakthrough or a benchmark artifact, and what would peer-reviewed confirmation mean for AI's role in formal mathematics?
Trump's AI executive order was killed by Musk and Zuckerberg lobbying against even a voluntary testing regime [3], while the US-China safety protocol requires frontier model governance — how does a domestic governance posture that cannot sustain voluntary oversight sustain credible international commitments on AI safety?
Cross-industry security research now characterizes AI-generated code as '4× faster, 10× riskier' [10] and CVE-2025-59532 confirmed a sandbox bypass enabling remote code execution in the Codex CLI [11] — are these incidents accumulating toward changed enterprise procurement decisions, or will productivity imperatives continue to dominate security concerns?

Thread movements (31)

us-china-ai-safety-protocol — Treasury Secretary Bessent and Federal Reserve Chair Powell are confirmed to have held joint emergency meetings with US bank CEOs about Anthropic's Mythos cybersecurity risks before the Beijing summit [1][2], establishing Mythos as the specific threat driving both domestic financial regulation and bilateral diplomacy; Japan's PM Takaichi — confirmed as head of government, not merely LDP cybersecurity chief [12] — calls the situation a 'race against time' [13], while the G7 has begun separate frontier-model governance discussions [14] and a draft US Executive Order proposing a 'covered frontier model' definitional process is circulating [15][16].
anthropic-enterprise-losses — The root cause of the Pentagon-Anthropic conflict is now confirmed: the DoD demanded Anthropic permit its AI for weapons and surveillance use, Anthropic refused [21], the DoD moved to bar Anthropic from all government contracts, and a federal court blocked that move [22] — even as contradictory signals persist with a $200M DoD agreement [23] and a classified NSA contract reportedly in progress [24]; Microsoft's cancellation of Claude Code licenses is separately confirmed as competitive consolidation toward GitHub Copilot [25][26].
ai-formal-math-breakthroughs — Three major AI labs — OpenAI, Harmonic, and Google DeepMind — have each produced systems capable of generating or verifying non-trivial mathematical proofs, with OpenAI's unreleased general-purpose reasoning model reportedly disproving the Erdős unit distance conjecture, open since 1946 [4]; Harmonic's Aristotle generates proofs checkable in the Lean formal verification language [32] and DeepMind's system grounds every reasoning step in Lean before proceeding [33], with the convergence drawing mainstream science coverage from Nature, Quanta Magazine, The Guardian, and New Scientist.
ai-content-web-degradation — The New York Times faced a second distinct AI hallucination incident — fabricated quotes embedded in a book review of 'The Future of Truth,' a work explicitly about AI-generated misinformation [5] — and issued formal warnings to freelancers [42][43]; two previously open empirical questions now have research-backed answers: Google AI Overviews are cutting publisher traffic by 25–42% [6][7] and Cornell and USC peer-reviewed research confirms AI writing assistance homogenizes text toward Western styles [44][45]; Barracuda Networks separately reports that half of all email spam is now AI-generated [8].
openai-codex-enterprise-rollout — OpenAI's Codex enterprise campaign reached two opposing milestones simultaneously: Gartner named OpenAI a Leader in the 2026 Magic Quadrant for enterprise coding agents [50][51], providing the first independent analyst validation, while CVE-2025-59532 confirmed a sandbox bypass enabling remote code execution in the Codex CLI [11] and a community user documented Codex deleting files on their machine [52]; GitHub also formally launched Claude and Codex as selectable agents in its Agent HQ platform in public preview [53][54].
ai-offensive-cyber — The TeamPCP supply chain attack on npm/PyPI packages linked to AI labs has escalated to double-extortion: the group has stated it will publish Mistral AI's source code publicly if no buyer is found at its $25,000 asking price [56][57]; separately, Anthropic's official anthropic.com/glasswing page confirms Project Glasswing is Anthropic's own defensive security initiative [58], with Cloudflare as a testing partner that documented Mythos chaining exploits across 50+ repositories [59].
ai-legal-hallucination — Sullivan & Cromwell — one of Wall Street's most elite law firms — acknowledged in April 2026 that AI hallucinations caused errors in a high-profile court filing [62][63], demonstrating the crisis extends to top-tier practices; a public database now tracks 1,453 AI hallucination cases in court filings worldwide, up from fewer than 120 two years ago [64][65], and bar association guidance from the ABA, California, and Texas is being criticized by practitioners as too vague to be actionable [66][67][68].
coding-agents-software-economics — Empirical research now directly measures AI-generated technical debt in production codebases [71][72], converting theoretical maintenance-cost warnings into arxiv-published findings; practitioners have introduced new vocabulary for specific failure modes — 'cognitive debt' (agents write faster than engineers can read) [73] and the '80% Problem' (agents complete visible tasks while leaving hidden architectural debt) [74] — while the Jevons paradox thesis gained its first mainstream economic endorsement via Fortune and economist Torsten Slok [75].
china-ai-rising — Chinese open-source models now account for 30% of global AI usage and have overtaken US-origin models in Hugging Face download share [79][80], roughly one year after DeepSeek's initial impact; a coordinated wave of models is being natively optimized for Huawei's Ascend chips, building an alternative compute stack independent of US export controls [81]; the Stanford 2026 AI Index provides the first named data-sourced counterpoint, finding the US still leads in AI performance [82].
saas-ai-disruption — OpenAI's DeployCo launched officially on May 11 with $4 billion from 19 investors, approximately 150 Forward Deployed Engineers acquired through Tomoro, and McKinsey, Bain, and Capgemini as consulting partners [88], making concrete Chamath Palihapitiya's earlier citation of it as the vehicle threatening ~90% of public SaaS companies [89]; a direct rebuttal to Chamath's thesis has emerged from analyst Chase Roberts [90].
anthropic-partnerships-expansion — Anthropic officially confirmed a $30 billion Series G at a $380 billion post-money valuation, described as the second-largest funding deal of all time [9][94], while May 12 reporting separately describes the company in talks at a $950 billion valuation [95], leaving two figures from overlapping timeframes unreconciled; Andrej Karpathy's role was clarified across multiple outlets as building and leading a new Claude-focused pre-training research team rather than joining an existing unit [96].
enterprise-ai-coding-battle — Google's A2A interoperability protocol has crossed 150 adopting organizations and reached enterprise production use within its first year [100], adding Google as a named third competitor to the Anthropic-OpenAI coding agent battle; both Anthropic and OpenAI are now explicitly described as building Palantir-style consulting and deployment arms to embed AI inside enterprise accounts [101], introducing a high-touch convergence dimension not present in earlier framing.
ai-deployment-misalignment-risk — Anthropic published its own 'Persona Selection Model' [105][106], examining why AI assistants adopt different behavioral personas — occupying conceptual territory adjacent to Alex Mallen's behavioral selection model without explicitly engaging his institutional critique that major AI company risk reports fail to address deployment-time misalignment spread; the alignment community is simultaneously debating deceptive alignment probability estimates [107][108][109] that provide context for Mallen's argument.
world-models-acceleration — Odyssey's Agora-1 world model now has documented institutional backing — NVIDIA NVentures, Samsung Next, $9M seed, Crusoe Cloud compute [113] — and its GoldenEye demo shows four simultaneous players sharing one AI-generated reality with no underlying game engine [114][115]; Demis Hassabis delivered a dual message championing world models as the essential next architectural step while simultaneously warning that the AI bubble is real [116][117], and Google's I/O framing elevated world models to an explicit strategic narrative: 'predicting text to simulating reality' [118].
google-io-2026-gemini — Google I/O 2026 launched Gemini 3.5 Flash — described as Google's fastest and strongest agentic/coding model, outperforming Gemini 3.1 Pro [121][122] — and Antigravity 2.0, a rearchitected developer platform with a new CLI and desktop app [123][124]; both launches drew immediate mixed-to-negative developer reactions within hours of release [125][126][127][128], and third-party analysis found Flash 3.5's real-world benchmark performance notably below Google's own figures with catastrophically low sycophancy scores [129].
google-io-gemini-launch — Workspace Studio graduated from a brief announcement mention to a documented product story — a no-code agent-building platform inside Gmail, Drive, Docs, and Chat [130][131][132] — and enterprise compliance has emerged as a third named fault line, with analysts arguing public AI providers structurally cannot satisfy regulated-industry requirements for Gemini Spark's always-on data access model [133][134]; Artificial Analysis called Flash 'the new leader in intelligence versus speed' [135] while cross-lab benchmarks show Claude Opus 4.7 scoring above Flash on SWE-Bench Verified [136].
gemini-35-flash-release — Google launched an enterprise counter-narrative: VentureBeat reports Google claiming Gemini 3.5 Flash can slash enterprise AI costs by more than $1 billion per year versus competing providers [138], reframing a 3× internal price hike as cross-vendor savings; independent benchmarks note the model scores within two points of Anthropic's flagship at a third of that flagship's price [139], while a developer billing anomaly thread documents unexpected cost spikes on the preview model despite decreased usage [140].
google-io-2026-launch-blitz — Wired and Gizmodo explicitly frame Gemini Spark as Google's direct response to OpenClaw's 24/7 AI agent [143][144], adding a named-rival dimension to what had been described as a standalone product; documentation confirms Gemini Spark integrates across Gmail, Calendar, Drive, Maps, and YouTube [145], while SynthID attracted renewed explainer coverage without new empirical accuracy claims.
us-china-chip-export-debate — Huawei's China AI chip market position has hardened from anecdotal dominance to documented figures: multiple independent estimates now place its share between 41% and 60% [149][150][151] with $12 billion in projected 2026 revenue [152] and plans to double Ascend output [153]; AMD CEO Lisa Su escalated from soft hedging to actively warning against strict export controls [154], aligning AMD publicly with Nvidia and narrowing the industry-versus-safety-advocates divide.
anthropic-enterprise-expansion — Anthropic launched a dedicated AI services company backed by Blackstone, Goldman Sachs, Hellman & Friedman, and other major PE firms targeting mid-sized enterprises [156], adding a third distribution tier to its commercial strategy; Claude Code reached $1 billion in annualized revenue [157] (with one source citing $2.5 billion [158]), prompting the Bun JavaScript runtime acquisition [159], while a June 15 pricing change capping Agent SDK credits is drawing sharp backlash from power users [160][161][162].
aschenbrenner-13f-agi-thesis — Bloom Energy's 2026 market performance provides live validation of Aschenbrenner's energy-bottleneck thesis: a 23% single-day surge [164], $7.65 billion in new 90-day contracts [165], and a path to 2 GW annual production capacity [166]; the fund's long book has expanded with positions in IREN and Core Scientific — Bitcoin miners repositioned as AI compute infrastructure [167][168].
amodei-ai-economic-disruption — A Harvard Business Review piece documents that companies are already laying off workers based on AI's potential rather than demonstrated performance [170], shifting displacement from future forecast to present-tense mechanism; a complementary reframing suggests the primary AI job risk is 'never getting hired in the first place' rather than explicit layoffs [171], raising questions about whether Amodei's policy prescriptions address the actual displacement pathway.
ai-offensive-cybersecurity — A cluster of independent security industry research — from Veracode, IOActive, AppSec Santa, Kusari, and ArmorCode — now directly answers whether AI-generated code introduces security vulnerabilities at elevated rates, with the most pointed framing characterizing the tradeoff as '4× faster development, 10× greater risk' [10][175][176], shifting from a single data point to cross-industry consensus; the International AI Safety Report 2026 introduces the first multilateral safety-research body into the thread [177].
ai-graduation-backlash — The graduation backlash has confirmed itself as a multi-campus pattern: Florida students booed an AI-as-Industrial-Revolution speech on May 12 [181], UCF saw similar reactions [182], and a Glendale Community College ceremony erupted when AI name-reading software mispronounced and skipped graduates' names [183][184]; polling quantifies the substrate — 70% of college students view AI as a job threat [185] and 47% of 2026 graduates worry AI is already limiting entry-level openings [186].
anthropic-ai-values-widening — Anthropic's Claude values initiative gained its first named public figure: Amanda Askell, the PhD philosopher at its center, was profiled simultaneously in the Wall Street Journal [190], Vox [191], and Der Spiegel [192], with Vox reporting that the moral framework she developed runs to approximately 80 pages — a concrete artifact that makes the initiative testable and scrutinizable for the first time.
openclaw-warelay-origin — A significant complication to OpenClaw's xAI partnership has emerged: the Grok account on X stated directly that 'you can now use your X Premium subscription inside Hermes Agent' [196], introducing genuine ambiguity about which open-source project holds xAI's backing; Hermes Agent from Nous Research now has confirmed institutional markers — an official domain, 47,000 GitHub stars [197] — while Wired and Gizmodo simultaneously describe Gemini Spark as 'Google's answer to OpenClaw' [143][198].
ai-content-provenance-watermarking — TikTok/ByteDance is confirmed as a major distribution-side C2PA participant that has been automatically labeling AI-generated content using signals from providers including OpenAI since May 2024 [202][203]; Hacker Factor published a technical critique titled 'Massive C2PA Failures' targeting the Pixel 10 implementation [204], introducing real-world engineering failures as a distinct challenge category separate from adversarial attacks; the NDSS 2026 LLM watermark attack now has public code on GitHub [205], lowering the barrier for independent replication.
ai-agents-hype-reality — IAB Tech Lab formally launched AAMP — Agentic Advertising Management Protocols — creating commercial technical standards for 'agentic advertising' before any shared definition of the term exists [208][209], locking in a working definition through protocol design; governance framework proliferation has deepened with Mayer Brown [210] and Cloud Security Alliance/NIST [211] adding legal and standards-body frameworks, while the AI Agents vs. Agentic AI taxonomy is spreading from arXiv into practitioner LinkedIn and Medium channels [212][213].
ai-company-singapore-race — OpenAI and Google DeepMind announced major Singapore institutional partnerships within days of each other in May 2026: OpenAI unveiled its first Applied AI Lab outside the US, backed by more than S$300 million and 200+ planned technical roles [215], while Google DeepMind announced a national partnership spanning healthcare, education, and AI safety benchmarking [216], intensifying competition among frontier labs for Asia-Pacific institutional footholds.
deepmind-co-scientist — Google's Co-Scientist Nature publication has now reached mainstream tech media via Google I/O, with The Verge, Engadget, CNET, and Mashable all covering it [222][223][224][225]; LabCritics published the first specialist analysis treating Co-Scientist's arc from research demo to Nature paper as the subject itself [226], and the companion ERA system received science-press pickup under the headline 'AI system automates coding for scientific research' [227].
karpathy-joins-anthropic — International media coverage of Andrej Karpathy's move to Anthropic continues to amplify [228][229][230], with his role now consistently defined across outlets as leading a new Claude-focused pre-training research team [96]; no new factual developments have emerged since the initial announcement.

Notable items (7)

Trump abruptly cancels EO signing event after top AI firm CEOs declined to go
Ars Technica AI

Trump canceled the signing of a voluntary AI safety executive order — which would have required frontier AI firms to share models with the government 90 days before public release — after top AI CEOs declined the ceremony and Elon Musk and Mark Zuckerberg successfully lobbied the administration's accelerationist faction to pull it, with some executives reportedly already mid-flight to the Oval Office when the event was canceled [3].
US scrambles to stop Internet users re-creating dead pilots’ voices
Ars Technica AI

AI image-recognition tools allowed internet users to reconstruct approximate cockpit audio from spectrogram images published in NTSB accident investigation reports — circumventing a federal law prohibiting public release of cockpit voice recorder audio — prompting the NTSB to suspend its entire public accident docket system in response [233].
AI put "synthetic quotes" in his book. But this author wants to keep using it.
Ars Technica AI

A New York Times investigation found that the book 'The Future of Truth: How AI Reshapes Reality' — explicitly about how AI fabricates and distorts truth — contained AI-generated synthetic quotes attributed to real people including Kara Swisher, who confirmed she 'never said' the attributed quote, and Professor Lisa Feldman Barrett, who confirmed the quotes 'don't appear in my book, and they are also wrong' [234].
Did Google’s AI agents really build an operating system for $916?
AI Snake Oil

AI Snake Oil's Sayash Kapoor methodically dismantles Google's claim to have built an operating system for $916 with a 'single prompt': the prompt was in fact thousands of lines long, no similarity analysis was run to check for copied open-source code, and Google has not released the prompt, generated code, or execution logs — arguing that open-world AI capability evaluations require new methodological norms to be credible [235].
The memory shortage is causing a repricing of consumer electronics
Simon Willison

HBM's share of memory wafer allocation is expected to reach 20% by end of 2026 (up from 2% recently), and each gigabyte of HBM consumes more than three times the wafer capacity of DDR or LPDDR — creating a zero-sum trade-off between AI GPU memory and consumer device memory that is already hitting sub-$100 smartphones and outsized impact on markets in Africa and South Asia [236].
FTC to Require Cox Media Group, Two Other Firms to Pay Nearly $1 Million to Settle Charges They Deceived Customers About “Active Listening” AI-Powered Marketing Service
Simon Willison

The FTC settled charges against Cox Media Group and two other firms for nearly $1 million after finding their 'Active Listening' AI advertising service — marketed as capturing real-time voice data from smart devices for ad targeting — did not use voice data at all and was actually reselling email lists from data brokers at a markup [237].
Cerebras reported 981 tokens/sec on the 1T-parameter Kimi K2.6 model.
Rohan Paul Twitter

Cerebras reported 981 tokens per second throughput on the 1-trillion-parameter Kimi K2.6 model — 6.7× faster than the next-fastest GPU cloud alternative, validated by Artificial Analysis — with the bottleneck characterized as moving model weights and activations across chips fast enough rather than raw computation [238].