Related papers: Mathematical exploration and discovery at scale

Mathematical exploration and discovery at scale

URL: http://arxiv.org/abs/2511.02864v1
Date: Mon, 03 Nov 2025 16:04:07 GMT
Title: Mathematical exploration and discovery at scale
Authors: Bogdan Georgiev, Javier Gómez-Serrano, Terence Tao, Adam Zsolt Wagner,
Abstract summary: AlphaEvolve is a generic evolutionary coding agent that proposes, tests, and refines algorithmic solutions to scientific and practical problems.<n>To demonstrate its breadth, we considered a list of 67 problems spanning mathematical analysis, combinatorics, geometry, and number theory.<n>In some instances, AlphaEvolve is also able to generalize results for a finite number of input values into a formula valid for all input values.
Score: 1.2341209649260667
License: http://creativecommons.org/licenses/by/4.0/
Abstract: AlphaEvolve is a generic evolutionary coding agent that combines the generative capabilities of LLMs with automated evaluation in an iterative evolutionary framework that proposes, tests, and refines algorithmic solutions to challenging scientific and practical problems. In this paper we showcase AlphaEvolve as a tool for autonomously discovering novel mathematical constructions and advancing our understanding of long-standing open problems. To demonstrate its breadth, we considered a list of 67 problems spanning mathematical analysis, combinatorics, geometry, and number theory. The system rediscovered the best known solutions in most of the cases and discovered improved solutions in several. In some instances, AlphaEvolve is also able to generalize results for a finite number of input values into a formula valid for all input values. Furthermore, we are able to combine this methodology with Deep Think and AlphaProof in a broader framework where the additional proof-assistants and reasoning systems provide automated proof generation and further mathematical insights. These results demonstrate that large language model-guided evolutionary search can autonomously discover mathematical constructions that complement human intuition, at times matching or even improving the best known results, highlighting the potential for significant new ways of interaction between mathematicians and AI systems. We present AlphaEvolve as a powerful new tool for mathematical discovery, capable of exploring vast search spaces to solve complex optimization problems at scale, often with significantly reduced requirements on preparation and computation time.

Related papers

Towards Autonomous Mathematics Research [48.29504087871558]
We introduce Aletheia, a math research agent that iteratively generates, verifies, and revises solutions end-to-end in natural language.<n>Specifically, Aletheia is powered by an advanced version of Gemini Deep Think for challenging reasoning problems.<n>We demonstrate Aletheia from Olympiad problems to PhD-level exercises and most notably, through several distinct milestones in AI-assisted mathematics research.
arXiv Detail & Related papers (2026-02-10T18:50:15Z)
Accelerating Scientific Research with Gemini: Case Studies and Common Techniques [105.15622072347811]
Large language models (LLMs) have opened new avenues for accelerating scientific research.<n>We present a collection of case studies demonstrating how researchers have successfully collaborated with advanced AI models.
arXiv Detail & Related papers (2026-02-03T18:56:17Z)
Even with AI, Bijection Discovery is Still Hard: The Opportunities and Challenges of OpenEvolve for Novel Bijection Construction [7.629457153784809]
Evolutionary program synthesis systems such as AlphaEvolve, OpenEvolve, and ShinkaEvolve offer a new approach to AI-assisted mathematical discovery.<n>These systems utilize teams of large language models (LLMs) to generate candidate solutions to a problem as human readable code.<n>We describe the results of applying OpenEvolve to three construction problems involving Dyck paths, two of which are known and one of which is open.
arXiv Detail & Related papers (2025-11-26T02:30:17Z)
AI Mathematician as a Partner in Advancing Mathematical Discovery - A Case Study in Homogenization Theory [6.856242640393325]
We investigate how the AI Mathematician (AIM) system can operate as a research partner rather than a mere problem solver.<n>We reveal how human intuition and machine computation can complement one another.<n>The approach leads to a complete and verifiable proof, and more broadly, demonstrates how systematic human-AI co-reasoning can advance the frontier of mathematical discovery.
arXiv Detail & Related papers (2025-10-30T11:22:15Z)
CodeEvolve: An open source evolutionary coding agent for algorithm discovery and optimization [0.6198237241838559]
We introduce CodeEvolve, an open-source evolutionary coding agent that unites Large Language Models with genetic algorithms to solve complex computational problems.<n>Our framework adapts powerful evolutionary concepts to the Large Language Models domain, building upon recent methods for generalized scientific discovery.<n>We conduct a rigorous evaluation of CodeEvolve on a subset of the mathematical benchmarks used to evaluate Google DeepMind's closed-source AlphaEvolve.
arXiv Detail & Related papers (2025-10-15T22:58:06Z)
Barbarians at the Gate: How AI is Upending Systems Research [58.95406995634148]
We argue that systems research, long focused on designing and evaluating new performance-oriented algorithms, is particularly well-suited for AI-driven solution discovery.<n>We term this approach as AI-Driven Research for Systems ( ADRS), which iteratively generates, evaluates, and refines solutions.<n>Our results highlight both the disruptive potential and the urgent need to adapt systems research practices in the age of AI.
arXiv Detail & Related papers (2025-10-07T17:49:24Z)
OMEGA: Can LLMs Reason Outside the Box in Math? Evaluating Exploratory, Compositional, and Transformative Generalization [88.76091817642963]
Recent large-scale language models (LLMs) with long Chain-of-such reasoning as DeepSeek-R1-have achieved impressive results on Olympiad-level mathematics.<n>We introduce OMEGA-Out-of-distribution Math Problems Evaluation with 3 Generalization Axes-a benchmark designed to evaluate three axes of out-of-distribution generalization.
arXiv Detail & Related papers (2025-06-23T17:51:40Z)
AlphaEvolve: A coding agent for scientific and algorithmic discovery [63.13852052551106]
We present AlphaEvolve, an evolutionary coding agent that substantially enhances capabilities of state-of-the-art LLMs.<n>AlphaEvolve orchestrates an autonomous pipeline of LLMs, whose task is to improve an algorithm by making direct changes to the code.<n>We demonstrate the broad applicability of this approach by applying it to a number of important computational problems.
arXiv Detail & Related papers (2025-06-16T06:37:18Z)
Evolutionary thoughts: integration of large language models and evolutionary algorithms [2.3633885460047765]
Large Language Models (LLMs) have unveiled remarkable capabilities in understanding and generating both natural language and code.<n>We propose an enhanced evolutionary search strategy that enables a more focused exploration of expansive solution spaces.
arXiv Detail & Related papers (2025-05-09T03:32:18Z)
When Large Language Models Meet Evolutionary Algorithms: Potential Enhancements and Challenges [50.280704114978384]
Pre-trained large language models (LLMs) exhibit powerful capabilities for generating natural text.<n> Evolutionary algorithms (EAs) can discover diverse solutions to complex real-world problems.
arXiv Detail & Related papers (2024-01-19T05:58:30Z)
A Survey of Deep Learning for Mathematical Reasoning [71.88150173381153]
We review the key tasks, datasets, and methods at the intersection of mathematical reasoning and deep learning over the past decade. Recent advances in large-scale neural language models have opened up new benchmarks and opportunities to use deep learning for mathematical reasoning.
arXiv Detail & Related papers (2022-12-20T18:46:16Z)
Creative AI Through Evolutionary Computation: Principles and Examples [16.8615211682877]
Population-based search techniques make it possible to find creative solutions to practical problems in the real world. evolutionary computation is the likely "next deep learning"
arXiv Detail & Related papers (2020-08-04T19:53:39Z)

This list is automatically generated from the titles and abstracts of the papers in this site.