The Information Machine

OpenAI Model Disproves 80-Year-Old Erdős Geometry Conjecture · history

Version 3

2026-05-23 04:29 UTC · 106 items

What

An OpenAI general-purpose reasoning model has disproved the planar unit distance conjecture — an 80-year-old open problem first posed by Paul Erdős in 1946 — by constructing a counterexample [1]. A formal preprint appeared on arXiv within days [2][3], and prominent mathematicians including combinatorialist Gil Kalai have endorsed the result via X and his blog [7][8]. The Guardian and Scientific American have characterized it as AI's most significant mathematical achievement to date [9][10]. A persistent credibility question now runs alongside the acclaim: multiple sources document a prior episode in which OpenAI secretly funded the FrontierMath benchmark before achieving records on it, a pattern mathematician Michael Harris has explicitly labeled a 'scandal' [15][14][16].

Why it matters

Settling an 80-year-old open problem through a novel cross-domain method — algebraic number theory applied to plane geometry — suggests that general-purpose reasoning models may be capable of original mathematical discovery rather than assisted search or verification. The accompanying credibility question matters equally: if OpenAI's math achievement claims cannot be independently verified, the field lacks the trust infrastructure needed to build on these results.

Open questions

  • Has the arXiv preprint entered formal peer review, and does the broader combinatorics and discrete geometry community independently accept the counterexample? [2][3]

  • Which specific OpenAI model was used, at what scale of test-time compute, and how much human problem-framing or scaffolding shaped the result — details absent from the public announcement? [1]

  • Does the algebraic number theory approach generalize to other open problems in discrete geometry, or is it structurally specific to this conjecture? [4][2]

  • Given that mathematician Michael Harris and multiple tech outlets are now framing OpenAI's FrontierMath benchmark involvement as a 'scandal' [15][13][16], will this result receive the independent mathematical scrutiny needed to establish lasting credibility beyond the arXiv preprint?

Narrative

On May 20, 2026, OpenAI announced that one of its general-purpose reasoning models had disproved the planar unit distance conjecture in discrete geometry, a problem first posed by Paul Erdős in 1946 [1]. The model produced a counterexample rather than a proof, establishing the conjecture is false. OpenAI framed the outcome as a landmark in AI-driven mathematics; its published announcement provided no caveats but also little methodological detail [1]. Within approximately one day, a preprint titled 'Remarks on the disproof of the unit distance conjecture' appeared on arXiv, providing the first publicly accessible formal documentation of the mathematics underlying the result [2][3].

The method the model used has attracted sustained attention. Early commentary identified a surprising bridge between algebraic number theory and plane geometry as the central innovation — treating the approach as more significant than the bare fact of disproving an 80-year-old conjecture [4][5]. This framing was amplified widely: Alex Dimakis, a researcher in machine learning and information theory, described the result as a breakthrough in 'a very famous Combinatorics problem' [6]. Gil Kalai — a prominent combinatorialist whose blog closely follows open problems in this area — explicitly endorsed the achievement via both his WordPress blog and on X, calling it 'amazing' and crediting AI [7][8]. Po-Shen Loh, Zvi Mowshowitz, and Greg Brockman also weighed in publicly, and mainstream press including Scientific American and The Guardian characterized it as AI's biggest mathematical breakthrough yet [9][10][11][12].

A structural credibility question runs alongside the acclaim. Multiple sources have surfaced a prior episode in which OpenAI secretly funded the FrontierMath benchmark before its o3 model achieved records on it — a conflict of interest that TechCrunch reported resulted in o3 scoring lower on independent assessments of that benchmark than OpenAI's initial announcements implied [13][14]. Mathematician and blogger Michael Harris — author of 'Mathematics without Apologies' and the Silicon Reckoner substack — has written about this episode under the explicit framing of 'The FrontierMath scandal' [15], and TechRepublic has separately documented benchmark discrepancies in OpenAI's math claims [16]. These prior episodes do not directly impeach the unit distance conjecture result, which is grounded in an arXiv preprint subject to mathematical scrutiny [2], but they establish a pattern of concern around evaluator independence in OpenAI's self-announced mathematical achievements.

The result arrives during a broader wave of AI progress in mathematics already well underway heading into mid-2026. Quanta Magazine had published 'The AI Revolution in Math Has Arrived' in April 2026 [17], and DeepMind's Gemini Deep Think program is separately pursuing AI-accelerated scientific discovery [18]. UC Irvine and USC received a $2.6 million DARPA grant for AI-driven mathematics as recently as May 18, 2026 [19]. Against this backdrop, OpenAI's unit distance result is both singular and part of a trend — making the question of independent peer review of the arXiv preprint particularly consequential for establishing how much weight the broader community will place on it.

Timeline

  • 1946: Paul Erdős first poses the planar unit distance conjecture in discrete geometry [5]
  • 2026-04: Quanta Magazine publishes 'The AI Revolution in Math Has Arrived,' establishing that AI mathematical capabilities are already a recognized trend [17]
  • 2026-05-18: UC Irvine and USC announced a $2.6 million DARPA grant for AI-driven mathematics breakthroughs [19]
  • 2026-05-20: OpenAI announces a general-purpose reasoning model has disproved the unit distance conjecture via counterexample [1]
  • 2026-05-21: ArXiv preprint 'Remarks on the disproof of the unit distance conjecture' appears; mathematician Gil Kalai endorses the result via X and his blog; Alex Dimakis's commentary is widely retweeted; The Guardian and Scientific American publish mainstream coverage [2][3][7][8][6][9][10]
  • 2026-05-21: Po-Shen Loh, Zvi Mowshowitz, and other named commentators weigh in; Reddit r/math and Hacker News host extended discussion [11][12][26][27]
  • 2026-05-22: Continued broad amplification across social media and science press; William Jin notes the result is 'monumental' but not AGI; Michael Harris's 'FrontierMath scandal' framing and TechCrunch/TechRepublic benchmark discrepancy reporting surface as background credibility context [25][15][14][16][13]

Perspectives

OpenAI

Presents the disproof as a landmark milestone in AI-driven mathematics; provides no methodological caveats or detail in the public announcement

Evolution: consistent

Gil Kalai (mathematician, combinatorics blogger)

Enthusiastically endorses the result via both X and his WordPress blog, calling it 'amazing' and crediting AI directly; his engagement from inside the relevant mathematical community lends the result significant credibility

Evolution: consistent — direct sources for his X post and blog now confirmed

Michael Harris (mathematician, Silicon Reckoner blogger)

Has explicitly framed OpenAI's prior FrontierMath benchmark involvement as a 'scandal,' raising structural concerns about evaluator independence in OpenAI's math achievement claims; a credentialed mathematical voice applying skeptical pressure

Evolution: new voice this pass

Alex Dimakis (ML/information theory researcher)

Frames the result as a breakthrough in combinatorics; his original post became the most widely retweeted expert commentary on the result

Evolution: consistent

Scientific American / The Guardian

Characterize the result as AI's biggest or most significant mathematical breakthrough yet; quote mathematicians expressing amazement

Evolution: consistent

TechCrunch / TechRepublic / Search Engine Journal

Tech press independently documenting the FrontierMath benchmark controversy: o3 scored lower on independent assessments than OpenAI initially implied, and OpenAI secretly funded the benchmark before achieving records on it

Evolution: new voice this pass

William Jin (@WilliamJin06)

Measured enthusiasm: calls the result 'monumental' but explicitly distinguishes it from AGI, signaling a preference for calibrated rather than maximalist interpretation

Evolution: consistent

Rohan Paul (@rohanpaul_ai)

Bullish; reads the result as evidence that test-time compute on a general-purpose model is sufficient for research-grade output without specialized architecture

Evolution: consistent

Zvi Mowshowitz

Cautiously impressed; calls this the first AI math result he finds genuinely impressive, embedding it within broader capability and safety commentary

Evolution: consistent

Milk Road AI (@MilkRoadAI)

Enthusiastically frames the novel method — not just the solved problem — as the watershed element

Evolution: consistent

Tensions

  • Transparency gap: OpenAI presents the result as a clean milestone with no methodological caveats [1], while the broader commentary — from Milk Road AI to Alex Dimakis to the arXiv preprint — treats the novel algebraic number theory method as the central puzzle that remains poorly explained in the public announcement [4][5][2]. [1][4][5][2]
  • General-purpose compute vs. specialized architecture: Rohan Paul argues that test-time compute on a general-purpose LLM is sufficient for frontier mathematical discovery [4], implicitly challenging the premise behind dedicated math-AI systems like DeepMind's Gemini Deep Think [18] — a claim neither confirmed nor denied by OpenAI's sparse announcement [1]. [4][18][1]
  • Evaluator independence: Mathematician Michael Harris has explicitly labeled OpenAI's FrontierMath benchmark involvement a 'scandal' [15], and TechCrunch and TechRepublic have independently documented benchmark discrepancies in OpenAI's math claims [13][16]. These prior episodes do not directly impeach the unit distance result — grounded in an independently verifiable arXiv preprint [2] — but they create a credibility gap that Gil Kalai's enthusiasm [7][8] and mainstream press coverage [10][9] have not yet fully closed. [15][13][16][2][7][8][10][9]

Sources

  1. [1] An OpenAI model has disproved a central conjecture in discrete geometry — OpenAI Blog (2026-05-20)
  2. [2] [2605.20695] Remarks on the disproof of the unit distance conjecture — reactive:openai-erdos-math-breakthrough
  3. [3] [PDF] Remarks on the disproof of the unit distance conjecture - arXiv — reactive:openai-erdos-math-breakthrough
  4. [4] A general-purpose LLM can produce frontier research when given enough test-time compute. — Rohan Paul Twitter (2026-05-21)
  5. [5] This is WILD! — Milk Road AI Twitter (2026-05-21)
  6. [6] RT @AlexGDimakis: A breakthrough by OpenAI in a very famous Combinatorics problem, the Planar Unit Distance problem by E... — reactive:openai-erdos-math-breakthrough (2026-05-22)
  7. [7] An internal model of Open AI disproved Erdos unit distance conjecture. — reactive:openai-erdos-math-breakthrough
  8. [8] Unit Distance Problem | Combinatorics and more — reactive:openai-erdos-math-breakthrough
  9. [9] OpenAI makes breakthrough on 80-year-old maths problem — reactive:openai-erdos-math-breakthrough
  10. [10] OpenAI announces AI's biggest math breakthrough yet — reactive:openai-erdos-math-breakthrough
  11. [11] Po-Shen Loh's Post - LinkedIn — reactive:openai-erdos-math-breakthrough
  12. [12] AI #169: New Knowledge — Zvi's AI Roundups (2026-05-21)
  13. [13] OpenAI's o3 AI model scores lower on a benchmark ... - TechCrunch — reactive:openai-erdos-math-breakthrough
  14. [14] OpenAI Secretly Funded Benchmarking Dataset Linked To o3 Model — reactive:openai-erdos-math-breakthrough
  15. [15] The FrontierMath scandal - by Michael Harris — reactive:openai-erdos-math-breakthrough
  16. [16] OpenAI's o3: AI Benchmark Discrepancy Reveals Gaps in ... — reactive:openai-erdos-math-breakthrough
  17. [17] The AI Revolution in Math Has Arrived | Quanta Magazine — reactive:openai-erdos-math-breakthrough
  18. [18] Gemini Deep Think: Redefining the Future of Scientific Research — reactive:openai-erdos-math-breakthrough
  19. [19] UC Irvine, USC receive $2.6 million DARPA grant for AI to drive math breakthroughs – UC Irvine News — reactive:openai-erdos-math-breakthrough
  20. [20] Amazing: Erdős' Unit Distance Problem was Disproved! It was ... — reactive:openai-erdos-math-breakthrough
  21. [21] RT @AlexGDimakis: A breakthrough by OpenAI in a very famous Combinatorics problem, the Planar Unit Distance problem by E... — reactive:openai-erdos-math-breakthrough (2026-05-22)
  22. [22] RT @AlexGDimakis: A breakthrough by OpenAI in a very famous Combinatorics problem, the Planar Unit Distance problem by E... — reactive:openai-erdos-math-breakthrough (2026-05-22)
  23. [23] RT @AlexGDimakis: A breakthrough by OpenAI in a very famous Combinatorics problem, the Planar Unit Distance problem by E... — reactive:openai-erdos-math-breakthrough (2026-05-22)
  24. [24] RT @AlexGDimakis: A breakthrough by OpenAI in a very famous Combinatorics problem, the Planar Unit Distance problem by E... — reactive:openai-erdos-math-breakthrough (2026-05-21)
  25. [25] @OpenAI This feels monumental. A general-purpose reasoning model making a frontier-level math contribution isn’t AGI, bu... — reactive:openai-erdos-math-breakthrough (2026-05-21)
  26. [26] OpenAI's internal model disproves Unit Distance Conjecture of Erdos — reactive:openai-erdos-math-breakthrough
  27. [27] An OpenAI model has disproved a central conjecture in discrete ... — reactive:openai-erdos-math-breakthrough