OpenAI claims reasoning model disproved 80-year-old Erdős conjecture in geometry
OpenAI claims its new reasoning model has produced an original mathematical proof disproving a geometry conjecture first posed by Paul Erdős in 1946. The company says this is the first time AI has autonomously solved a prominent open problem central to a field of mathematics, with verification from mathematicians including Thomas Bloom and Noga Alon.
OpenAI Claims Reasoning Model Disproved 80-Year-Old Erdős Conjecture
OpenAI says its new general-purpose reasoning model has produced an original mathematical proof disproving a geometry conjecture first posed by mathematician Paul Erdős in 1946.
According to OpenAI, the model discovered "an entirely new family of constructions" that outperform what mathematicians believed were the best possible solutions for nearly 80 years. The company claims this marks "the first time AI has autonomously solved a prominent open problem central to a field of mathematics."
Verification and Context
Unlike OpenAI's previous claim in October 2025—when former VP Kevin Weil incorrectly stated GPT-5 had solved 10 Erdős problems, only to discover the solutions already existed in literature—the company this time published supporting statements from multiple mathematicians.
Verification came from:
- Noga Alon
- Melanie Wood
- Thomas Bloom, who maintains the Erdős Problems website
Bloom, who previously called Weil's October post "a dramatic misrepresentation," stated: "AI is helping us to more fully explore the cathedral of mathematics we have built over the centuries."
Technical Significance
OpenAI emphasizes the proof came from a general-purpose reasoning model, not a system specifically designed for mathematical problems. The company says this demonstrates AI systems can now "hold together long, difficult chains of reasoning and connect ideas across fields in ways researchers may not have previously explored."
The specific Erdős problem, model name, benchmark performance, and technical details of the proof were not disclosed in the announcement.
What This Means
If verified through peer review, this would represent a significant milestone in AI-assisted mathematical research—moving from pattern matching existing solutions to genuine novel discovery. The claim's credibility is strengthened by mathematician endorsements and OpenAI's apparent caution after last year's embarrassment. However, the lack of technical details, model specifications, and peer-reviewed publication leaves key questions unanswered. The broader implication: general reasoning models may now be capable of autonomous discovery in physics, biology, and engineering, not just mathematics.
Related Articles
OpenAI adopts C2PA metadata standard and Google's SynthID watermarking for AI image detection
OpenAI is joining the C2PA open standard and embedding Google DeepMind's invisible SynthID watermark in all AI-generated images from its models. The company is launching a public verification tool that checks for both C2PA metadata and SynthID watermarks, though detection only works for images created by OpenAI's own products.
OpenAI launches ChatGPT financial account integration for Pro users via Plaid partnership
OpenAI is rolling out a preview feature allowing ChatGPT Pro users in the US to connect their bank accounts through Plaid for personalized financial advice. The integration provides access to 12,000+ financial institutions and generates visual dashboards of user finances.
OpenAI brings Codex coding agent to iOS and Android with remote environment monitoring
OpenAI has integrated its Codex coding agent into the ChatGPT mobile app for iOS and Android, allowing developers to monitor live development environments and manage workflows from their phones. The update, announced May 14, 2026, is now available in preview across all ChatGPT plans.
OpenAI adds remote Codex control to ChatGPT mobile apps for iOS and Android
OpenAI has integrated remote Codex control into the ChatGPT mobile apps for iPhone and Android. Users can now approve tasks, review outputs, and manage Codex running on Mac computers, laptops, or remote environments directly from their smartphones.
Comments
Loading...