benchmarkOpenAI

OpenAI GPT-5.4 Pro reportedly solves Erdős problem #1196 in 80 minutes, reveals novel mathematical connection

TL;DR

OpenAI's GPT-5.4 Pro model has reportedly solved Erdős open problem #1196 in approximately 80 minutes, with another 30 minutes to format the solution as a LaTeX paper. Mathematician Terence Tao notes the solution reveals a previously undescribed connection between integer anatomy and Markov process theory.

2 min read
0

OpenAI's GPT-5.4 Pro model has reportedly solved Erdős open problem #1196 in approximately 80 minutes, according to discussions in the Erdős Problems forum. The model spent an additional 30 minutes formatting the solution as a LaTeX paper. Formal verification of the solution is currently underway.

Mathematical significance

Mathematician Terence Tao commented in the forum that the solution reveals a previously undescribed connection between the anatomy of integers and Markov process theory. "That would be a meaningful contribution to the anatomy of integers that goes well beyond the solution of this particular Erdos problem," Tao wrote.

Kevin Barreto, who says he will soon join OpenAI's AI for Science team, noted that the Markov chain technique used by GPT-5.4 Pro represented a creative step that human mathematicians had overlooked despite years of work on the problem.

Technical details

No specific details about GPT-5.4 Pro's architecture, parameter count, or training have been disclosed. The model's name suggests it is a variant within the GPT-5 series, though OpenAI has not officially announced this model publicly.

The solution is currently undergoing formal verification—a standard process in mathematical proofs where independent mathematicians check the validity of the work.

Implications for AI capabilities

The case provides evidence for ongoing debates about whether large language models can discover genuinely new knowledge beyond recombining training data. According to the source, the solution demonstrates that "new, previously undescribed knowledge can also be hidden within already known data points."

The Markov chain approach, while theoretically accessible to human mathematicians working with existing mathematical knowledge, was not identified by researchers who had worked on the problem for years.

What this means

If verified, this represents a concrete example of an AI system producing novel mathematical insight—not merely solving problems with known solution methods, but identifying new theoretical connections. The 80-minute solve time suggests the model engaged in extended reasoning or search processes, though the exact computational approach remains undisclosed. The significance lies less in automating mathematical proof and more in the model's ability to identify non-obvious connections between disparate mathematical domains that human experts had not recognized.

Related Articles

model release

OpenAI releases GPT-Rosalind, biology-focused LLM trained on 50 common research workflows

OpenAI has released GPT-Rosalind, a large language model trained specifically on 50 common biology workflows and major biological databases. Unlike broader science-focused models from competitors, GPT-Rosalind targets specialized biology tasks including pathway analysis, drug target prioritization, and cross-disciplinary research navigation.

model release

OpenAI releases GPT-5.4-Cyber with tiered access verification system for cybersecurity work

OpenAI released GPT-5.4-Cyber, a model variant designed for defensive cybersecurity tasks with fewer restrictions on dual-use queries. Access is controlled through a tiered verification system in the Trusted Access for Cyber program, targeting thousands of vetted users compared to Anthropic's 40-organization Mythos Preview rollout.

product update

OpenAI's Codex for Mac now captures screenshots and sends them to cloud servers for processing

OpenAI's Codex desktop app for Mac has added Chronicle, a feature that periodically captures screenshots, sends them to OpenAI's servers for OCR and visual analysis, then stores text summaries as unencrypted Markdown files locally. The feature requires a $100+/month ChatGPT Pro subscription and is unavailable in the EU, UK, and Switzerland.

product update

OpenAI's Codex for Mac adds Chronicle feature using screen captures to enhance AI context

OpenAI released Chronicle for Codex on Mac, a feature that captures screen content to build contextual memories for the AI coding assistant. Available to Pro subscribers as a research preview, Chronicle runs background agents that generate memories from screen captures stored temporarily on device.

Comments

Loading...