DeepMind unveils ‘spectacular’ general-purpose science AI

May 14, 2025

101

Close up view of a computer chip inscribed with "AI" installed in a circuit board. — DeepMind says that AlphaEvolve has helped to improve the design of AI chips.Credit: Christian Ohde/IMAGO via Alamy

Google DeepMind has used chatbot models to come up with solutions to major problems in mathematics and computer science.

The system, called AlphaEvolve, combines the creativity of a large language model (LLM) with algorithms that can scrutinize the model’s suggestions to filter and improve solutions. It was described in a white paper released by the company on 14 May.

DeepMind hits milestone in solving maths problems — AI’s next grand challenge

“The paper is quite spectacular,” says Mario Krenn, who leads the Artificial Scientist Lab at the Max Planck Institute for the Science of Light in Erlangen, Germany. “I think AlphaEvolve is the first successful demonstration of new discoveries based on general-purpose LLMs.”

As well as using the system to discover solutions to open maths problems, DeepMind has already applied the artificial intelligence (AI) technique to its own practical challenges, says Pushmeet Kohli, head of science at the firm in London.

AlphaEvolve has helped to improve the design of the company’s next generation of tensor processing units — computing chips developed specially for AI — and has found a way to more efficiently exploit Google’s worldwide computing capacity, saving 0.7% of total resources. “It has had substantial impact,” says Kohli.

General-purpose AI

Most of the successful applications of AI in science so far — including the protein-designing tool AlphaFold — have involved a learning algorithm that was hand-crafted for its task, says Krenn. But AlphaEvolve is general-purpose, tapping the abilities of LLMs to generate code to solve problems in a wide range of domains.

DeepMind describes AlphaEvolve as an ‘agent’, because it involves using interacting AI models. But it targets a different point in the scientific process from many other ‘agentic’ AI science systems, which have been used to review the literature and suggest hypotheses.

AlphaEvolve is based on the firm’s Gemini family of LLMs. Each task starts with the user inputting a question, criteria for evaluation and a suggested solution, for which the LLM proposes hundreds or thousands of modifications. An ‘evaluator’ algorithm then assesses the modifications against the metrics for a good solution (for example, in the task of assigning Google’s computing jobs, researchers want to waste fewer resources).

How does ChatGPT ‘think’? Psychology and neuroscience crack open AI large language models

On the basis of which solutions are judged to be the best, the LLM suggests fresh ideas and over time the system evolves a population of stronger algorithms, says Matej Balog, an AI scientist at DeepMind who co-led the research. “We explore this diverse set of possibilities of how the problem can be solved,” he says.

AlphaEvolve builds on the firm’s FunSearch system, which in 2023 was shown to use a similar evolutionary approach to outdo humans in unsolved problems in maths¹. Compared with FunSearch, AlphaEvolve can handle much larger pieces of code and tackle more complex algorithms across a wide range of scientific domains, says Balog.

DeepMind says that AlphaEvolve has come up with a way to perform a calculation, known as matrix multiplication, that in some cases is faster than the fastest-known method, which was developed by German mathematician Volker Strassen in 1969². Such calculations involve multiplying numbers in grids and are used to train neural networks. Despite being general-purpose, AlphaEvolve outperformed AlphaTensor, an AI tool described by the firm in 2022 and designed specifically for matrix mechanics³.

DeepMind unveils ‘spectacular’ general-purpose science AI

General-purpose AI

AI helps assemble ‘brain’ of future quantum computer

Watch a human embryo implant itself — with brute force

Controversial climate report from Trump team galvanizes scientists into action

Most Popular

Sydney Sweeney Parties In Blue Jeans With Friends, Shows Off Stomach

New Balance 1906L Black Patent Croc U1906LCR Release Date

Why Your 9-to-5 Might Be the Best Launchpad for Your Startup

current and former OpenAI employees plan to sell ~$6B in stock to Thrive Capital, SoftBank, and others in a secondary sale that values OpenAI...

Recent Comments

ABOUT US

POPULAR POSTS

Sydney Sweeney Parties In Blue Jeans With Friends, Shows Off Stomach

New Balance 1906L Black Patent Croc U1906LCR Release Date

Why Your 9-to-5 Might Be the Best Launchpad for Your Startup

POPULAR CATEGORY