Google DeepMind: Pioneering Mathematical AI with AlphaProof and AlphaGeometry 2 Achieving Breakthrough at International Mathematical Olympiad

Published in

Google Developer Experts

4 min readJul 26, 2024

Introduction

Artificial General Intelligence (AGI) continues to advance, pushing the boundaries of what’s possible in science and technology. At Google DeepMind, significant progress has been made in developing cutting-edge AI systems designed to tackle complex mathematical problems. The latest breakthroughs, AlphaProof and AlphaGeometry 2, have demonstrated remarkable capabilities, achieving a significant milestone at the International Mathematical Olympiad (IMO).

AlphaProof and AlphaGeometry 2 have been developed to assist mathematicians in discovering new insights, algorithms, and solutions to open problems. These systems have successfully solved four out of six problems from this year’s IMO, achieving a performance level equivalent to a silver medalist. This achievement marks a significant step forward in the application of AI to formal mathematical reasoning and problem-solving.

AlphaProof: A Formal Approach to Reasoning

AlphaProof is an advanced system designed to autonomously prove mathematical statements using the formal language Lean. This system integrates a pre-trained language model with the AlphaZero reinforcement learning algorithm, the same algorithm that mastered the complex games of chess, shogi, and Go.

Advantages of Formal Languages

The use of formal languages in mathematical reasoning offers a critical advantage: the ability to formally verify proofs for correctness. This verification ensures that the proofs generated are not only accurate but also reliable. However, the application of formal languages in machine learning has historically been limited by the scarcity of human-written data available for training purposes.

Bridging the Gap with Natural Language

Natural language-based approaches, while having access to vast amounts of data, often generate plausible but incorrect intermediate reasoning steps and solutions. To address this, AlphaProof bridges these complementary spheres by fine-tuning a Gemini model to translate natural language problem statements into formal statements. This translation process has enabled the creation of a large library of formal problems with varying levels of difficulty, significantly enhancing the training dataset for AlphaProof.

AlphaProof’s Problem-Solving Mechanism

When presented with a problem, AlphaProof generates solution candidates and searches for proof steps in Lean to either prove or disprove them. Verified proofs reinforce AlphaProof’s language model, continuously improving its problem-solving capabilities. This iterative process ensures that AlphaProof becomes progressively more adept at tackling increasingly challenging problems.

Training for the International Mathematical Olympiad (IMO)

In preparation for the IMO, AlphaProof underwent intensive training by proving or disproving millions of problems across a broad spectrum of difficulties and mathematical topics. This rigorous training regimen spanned several weeks leading up to the competition. During the contest, the training loop continued, with AlphaProof reinforcing proofs of self-generated variations of the contest problems until complete solutions were achieved.

AlphaGeometry 2: A More Competitive System

AlphaGeometry 2 is a significantly improved version of its predecessor, AlphaGeometry. It is a neuro-symbolic hybrid system where the language model is based on Gemini and trained from scratch on an order of magnitude more synthetic data than its predecessor. This enhancement enabled the model to tackle much more challenging geometry problems, including those involving the movement of objects and equations related to angles, ratios, or distances.

AlphaGeometry 2 employs a symbolic engine that is two orders of magnitude faster than its predecessor. When presented with a new problem, a novel knowledge-sharing mechanism enables advanced combinations of different search trees to tackle more complex problems.

Before this year’s competition, AlphaGeometry 2 could solve 83% of all historical IMO geometry problems from the past 25 years, compared to the 53% rate achieved by its predecessor. For IMO 2024, AlphaGeometry 2 solved Problem 4 within 19 seconds after receiving its formalization.

Example Problem Solved by AlphaGeometry 2

To illustrate the capabilities of AlphaGeometry 2, consider Problem 4 from the IMO 2024:

Within 19 seconds, AlphaGeometry 2 formalized the problem, applied the necessary geometric principles, and verified the solution, showcasing its enhanced problem-solving capabilities.

Conclusion

The development of AlphaProof and AlphaGeometry 2 represents a significant advancement in the field of Artificial General Intelligence (AGI). By integrating formal languages with reinforcement learning and leveraging vast amounts of training data, these systems have demonstrated unprecedented capabilities in solving complex mathematical problems. The success of AlphaProof and AlphaGeometry 2 at the International Mathematical Olympiad, achieving a performance level equivalent to a silver medalist, highlights the potential of these AI systems to contribute to scientific and technological progress.

These breakthroughs not only showcase the power of combining formal and natural language approaches but also set the stage for future developments in AGI. As these systems continue to evolve, they promise to unlock new frontiers in mathematical reasoning, problem-solving, and beyond, offering valuable tools for researchers and mathematicians worldwide.

Learn more about this groundbreaking work here.