This episode of Babbage explores the profound transformation underway in the world of mathematics due to artificial intelligence. Once notoriously bad at arithmetic, large language models (LLMs) have rapidly advanced to solve complex mathematical problems, changing mathematicians' perspectives.
Anjani Trevedi, a former mathematician and correspondent for The Economist, highlights that traditional research mathematics focuses on proving fundamental truths through rigorous, aesthetically pleasing proofs, unlike the rote arithmetic taught in school. For centuries, mathematicians have relied on logical deduction, often building upon prior work, creating a body of knowledge that, while robust, is also widely dispersed and can be difficult to navigate. Computers have long been used for large-scale calculations and checking complex configurations, but modern AI represents a new frontier.
Initially, mathematicians were skeptical of AI's capabilities given its struggle with basic math. However, the landscape shifted dramatically in the past year, with LLMs solving long-standing puzzles like some of the Erdős problems and now tackling more complex proofs. Terence Tao, a Fields Medal-winning mathematician, describes AI as a "really bright undergraduate" that, unlike humans, can tirelessly apply hundreds of methods to thousands of problems. While AI often solves the "low-hanging fruit" – problems that are relatively easy but numerous and haven't received human attention – its ability to combine methods from disparate sub-fields of mathematics, a feat rarely achieved by human specialists, offers significant potential for discovering new connections and insights.
A major bottleneck in mathematics is the need for rigorous verification of proofs. AI models, though powerful, still make mistakes, with Tao estimating an error rate of 98%. This necessitates a robust system to ensure trust. The emerging solution is "auto-formalization," a process that automatically translates natural language mathematical proofs into computer code that can be formally verified for correctness. Patrick Shafto of DARPA describes the current body of mathematical knowledge as a "hot mess" of papers and ideas. Auto-formalization aims to create a unified, consistent, and machine-verifiable repository, which would accelerate progress, reduce redundant efforts, and make advanced mathematics more accessible.
A landmark achievement in this space is the formalization of Marina Vyazovska's Fields Medal-winning proof for the 8-dimensional sphere packing problem. An AI system called Gauss by Math Inc. successfully converted her proof, which took her years to develop, into computer-checkable code. Vyazovska notes that while the AI's output can be verbose (e.g., explicitly proving "2+2=4"), this process provides a deeper, formalized understanding of the complex methods used.
The vision for AI in mathematics includes the creation of "mathematical superintelligence" by companies like Harmonic, or GitHub-style platforms (like Google's Defined) where researchers can publish, download, and instantly verify proofs. Mathematicians view AI as a powerful tool to enhance their productivity, explore more ideas, and connect dots across different mathematical domains, ultimately broadening their intellectual aperture.
However, challenges remain. AI models can "hallucinate" or agree too readily with human prompts, potentially undermining originality. There are also critical issues concerning credit and attribution for human-generated work that AI systems ingest and then formalize or build upon. Vyazovska's experience with Math Inc. highlights the need for clear ethical guidelines as AI increasingly engages with mathematicians' work.
Ultimately, the consensus suggests that AI's role will be to handle the "grunt work" and mechanics of mathematical proof and formalization, creating a highly organized and verifiable knowledge base. This, in turn, could free human mathematicians to focus on the truly creative, intuitive, and conceptual aspects of mathematics, pushing the boundaries of inquiry and potentially leading to breakthroughs in other scientific fields. While AI may eventually generate novel proofs autonomously, the question of their utility and who will "consume" and interpret them remains a societal challenge.