GPT-5 Pro Disproves 80-Year-Old Erdős Conjecture via Formal Verification
The model disproved the $n^{1+o(1)}$ upper bound for the unit distance problem, identifying point sets with more than $n^{1+\delta}$ distances (source: OpenAI Index, May 20, 2026). Verification was co

The Pitch
OpenAI has released GPT-5 Pro, which autonomously disproved the 80-year-old Erdős planar unit distance conjecture using cross-disciplinary algebraic number theory. Verified via the Lean language, this result marks a shift from conversational mimicry to formal mathematical discovery. See OpenAI profile
Under the Hood
The model disproved the $n^{1+o(1)}$ upper bound for the unit distance problem, identifying point sets with more than $n^{1+\delta}$ distances (source: OpenAI Index, May 20, 2026). Verification was completed using the Lean formal proof language and a peer-review panel including Noga Alon and Melanie Wood (source: Dataconomy/KuCoin).
In terms of benchmarks, GPT-5 Pro reached 94.6% on AIME 2025 and 88.4% on GPQA Diamond (source: Stark Insider). While these numbers are statistically high, the system remains a black box. OpenAI has not disclosed the total compute budget or the number of failed attempts required to generate the successful proof (source: Reddit r/MachineLearning).
Current pricing for GPT-5 sits at $1.25 per 1M input tokens and $10 per 1M output tokens as of May 2026 (source: Vellum.ai). Critics argue the system acts as a "Stockfish for math," discovering counterexamples through massive search rather than fundamental conceptual understanding (source: HN Comment 4).
There are significant technical gaps in the public documentation. We don't know the exact model architecture—speculated to be a "Strawberry/o3" successor—and the specific "non-trivial tweaks" mentioned by postdocs in the Companion Remarks remain undisclosed (source: HN Comment 1).
OpenAI is also contending with a credibility gap following the April 2026 resignation of VP Kevin Weil. Weil departed after it surfaced that the company had falsely claimed GPT-5 solved 10 Erdős problems in late 2025 (source: AutoGPT/Dataconomy). This history of misrepresentation necessitates a cautious approach to their latest claims.
Marcus's Take
Do not mistake a brute-force search success for general reasoning capabilities in your production environment. While the Erdős proof is a genuine academic milestone, the high output costs and lack of transparency regarding failed attempts suggest GPT-5 is a high-compute specialist. Use it for complex verification where accuracy is non-negotiable, but stick to Claude 4.5 Opus or Claude 4 Sonnet for standard backend orchestration.
Ship clean code,
Marcus.

Marcus Webb - Senior Backend Analyst at UsedBy.ai
Related Articles

Slumber: A Rust-Based Terminal Alternative to Postman
Slumber utilizes the Ratatui framework and a local SQLite backend to provide a configuration-first HTTP client that resides entirely in the terminal (GitHub: LucasPickering/slumber). It targets senior

Actual Intelligence: The Wozniak Counter-Thesis to GPT-5 Ubiquity
Steve Wozniak’s May 2026 graduation speech identifies "Actual Intelligence" as the primary value proposition for new engineers (Business Insider). While models like GPT-5 and Claude 4.5 Opus have beco

Nx Console and the Compromise of 3,800 GitHub Repositories
Nx Console is the official UI for the Nx build system, designed to help 2.2 million developers manage complex monorepos and build pipelines. While it carries a "Verified Publisher" badge on the VS Cod
Stay Ahead of AI Adoption Trends
Get our latest reports and insights delivered to your inbox. No spam, just data.