News

Google’s DeepMind says its AI can tackle Math Olympiad problems

CALIFORNIA – Google DeepMind, Alphabet’s artificial intelligence (AI) research division, said it has made strides in solving complex maths problems, an area that remains challenging for today’s AI programs.

CALIFORNIA – Google DeepMind, Alphabet’s artificial intelligence (AI) research division, said it has made strides in solving complex maths problems, an area that remains challenging for today’s AI programs. On July 25, Google rolled out AlphaProof, which specialises in maths reasoning, and AlphaGeometry 2, an updated version of a model focused on geometry that the company debuted earlier in 2024. The programs aced four of the six problems featured in the International Mathematical Olympiad or Math Olympiad, an annual competition in which students tackle topics such as algebra and geometry, Google said in a blog post.  In the AI industry, where comparison between offerings is difficult, solving maths problems has become a key proof point. That is because large language models, which are trained on vast amounts of written text, tend to be biased towards linguistic rather than mathematical intelligence. While computers are good at numbers and traditional calculations, word-based maths problems fall outside of these norms and require more sophisticated reasoning skills. While AI tools are becoming more proficient at chatting naturally or producing images, they often struggle with problems that require planning or take multiple steps to solve. But Google and its competitors haven’t given up. The company’s biggest rival, OpenAI, has also been working on new reasoning technology, Bloomberg has reported. AlphaProof evolved from Google AI programs that have excelled at complex strategy games such as chess, Japanese chess and East-Asian board game Go, Google said. A DeepMind program famously beat one of the world’s top Go players in 2016. Large language models have a tendency to hallucinate, or deliver incorrect information in convincing fashion. Google said it sidestepped that challenge by using its AI to translate maths problems into technical statements, or what it called “formal language”. Another issue for AI systems in maths is the lack of available training data, unlike chatbots, which can glean information from vast troves of text online. As Google’s AlphaProof model successfully solves problems, its code is updated, allowing it to tackle ever more difficult challenges, the company said. It also released an improved version of its AlphaGeometry AI model, which it said was able to solve 83 per cent of all historical geometry problems included in the Math Olympiad, spanning the last 25 years. But Google’s researchers also said that AI is far from being able to replace human mathematicians with its problem-solving capabilities. “Even in the fullest ambition of what we’re trying to do, I think we are aiming to provide a system that can prove anything,” said Mr David Silver, Google DeepMind’s vice-president of reinforcement learning. “But that’s not the end of what mathematicians do.” Mr Silver said DeepMind’s AI models are more akin to slide rules or calculators: powerful computational tools that might one day help humans come up with mathematical proofs. But what the AI systems lack is imagination. “Mathematicians pose interesting problems,” he said. BLOOMBERG