Towards Geometry Problem Solving in the Large Model Era: A Survey
- URL: http://arxiv.org/abs/2506.02690v1
- Date: Tue, 03 Jun 2025 09:42:49 GMT
- Title: Towards Geometry Problem Solving in the Large Model Era: A Survey
- Authors: Yurui Zhao, Xiang Wang, Jiahong Liu, Irwin King, Zhitao Huang,
- Abstract summary: Geometry problem solving (GPS) represents a critical frontier in artificial intelligence.<n>GPS remains challenging due to the dual demands of spatial understanding and rigorous logical reasoning.<n>This survey systematically synthesizes GPS advancements through three core dimensions.
- Score: 38.68730304442357
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Geometry problem solving (GPS) represents a critical frontier in artificial intelligence, with profound applications in education, computer-aided design, and computational graphics. Despite its significance, automating GPS remains challenging due to the dual demands of spatial understanding and rigorous logical reasoning. Recent advances in large models have enabled notable breakthroughs, particularly for SAT-level problems, yet the field remains fragmented across methodologies, benchmarks, and evaluation frameworks. This survey systematically synthesizes GPS advancements through three core dimensions: (1) benchmark construction, (2) textual and diagrammatic parsing, and (3) reasoning paradigms. We further propose a unified analytical paradigm, assess current limitations, and identify emerging opportunities to guide future research toward human-level geometric reasoning, including automated benchmark generation and interpretable neuro-symbolic integration.
Related papers
- A Survey of Deep Learning for Geometry Problem Solving [72.22844763179786]
This paper provides a survey of the applications of deep learning in geometry problem solving.<n>It includes (i) a comprehensive summary of the relevant tasks in geometry problem solving; (ii) a thorough review of related deep learning methods; and (iii) a detailed analysis of evaluation metrics and methods.<n>Our goal is to provide a comprehensive and practical reference of deep learning for geometry problem solving to promote further developments in this field.
arXiv Detail & Related papers (2025-07-16T06:03:08Z) - AutoGPS: Automated Geometry Problem Solving via Multimodal Formalization and Deductive Reasoning [14.44742282076576]
AutoGPS is a neuro-symbolic collaborative framework that solves geometry problems with concise, reliable, and human-interpretable reasoning processes.<n>The MPF utilizes neural cross-modal comprehension to translate geometry problems into structured formal language representations.<n>The DSR takes the formalization as input and formulates geometry problem solving as a hypergraph expansion task.
arXiv Detail & Related papers (2025-05-29T12:01:20Z) - GeoGramBench: Benchmarking the Geometric Program Reasoning in Modern LLMs [7.605833826892782]
We present a benchmark of 500 carefully refined problems organized by a tailored three-level taxonomy that considers geometric complexity rather than traditional mathematical reasoning complexity.<n>Our comprehensive evaluation of 17 frontier LLMs reveals consistent and pronounced deficiencies.<n>These results highlight the unique challenges posed by program-driven spatial reasoning and establish GeoGramBench as a valuable resource for advancing research in symbolic-to-spatial geometric reasoning.
arXiv Detail & Related papers (2025-05-23T09:17:07Z) - Challenging the Boundaries of Reasoning: An Olympiad-Level Math Benchmark for Large Language Models [86.45058529521258]
OlymMATH is a novel Olympiad-level mathematical benchmark designed to rigorously test the complex reasoning capabilities of LLMs.<n>OlymMATH features 200 meticulously curated problems, each manually verified and available in parallel English and Chinese versions.
arXiv Detail & Related papers (2025-03-27T11:20:17Z) - Pi-GPS: Enhancing Geometry Problem Solving by Unleashing the Power of Diagrammatic Information [25.13992124041851]
This paper presents Pi-GPS, a novel framework that unleashes the power of diagrammatic information to resolve textual ambiguities.<n>We employ MLLMs to disambiguate text based on the diagrammatic context, while the verifier ensures the rectified output adherence to geometric rules.<n> Empirical results demonstrate that Pi-GPS surpasses state-of-the-art models, achieving a nearly 10% improvement on theorem3K over prior neural-symbolic approaches.
arXiv Detail & Related papers (2025-03-07T16:15:00Z) - Fuse, Reason and Verify: Geometry Problem Solving with Parsed Clauses from Diagram [78.79651421493058]
We propose a neural-symbolic model for plane geometry problem solving (PGPS) with three key steps: modal fusion, reasoning process and knowledge verification.
For reasoning, we design an explicable solution program to describe the geometric reasoning process, and employ a self-limited decoder to generate solution program autoregressively.
We also construct a large-scale geometry problem dataset called PGPS9K, containing fine-grained annotations of textual clauses, solution program and involved knowledge solvers.
arXiv Detail & Related papers (2024-07-10T02:45:22Z) - Learning to Solve Geometry Problems via Simulating Human Dual-Reasoning Process [84.49427910920008]
Geometry Problem Solving (GPS) has attracted much attention in recent years.
It requires a solver to comprehensively understand both text and diagram, master essential geometry knowledge, and appropriately apply it in reasoning.
Existing works follow a paradigm of neural machine translation and only focus on enhancing the capability of encoders, which neglects the essential characteristics of human geometry reasoning.
arXiv Detail & Related papers (2024-05-10T03:53:49Z) - GeoQA: A Geometric Question Answering Benchmark Towards Multimodal
Numerical Reasoning [172.36214872466707]
We focus on solving geometric problems, which requires a comprehensive understanding of textual descriptions, visual diagrams, and theorem knowledge.
We propose a Geometric Question Answering dataset GeoQA, containing 5,010 geometric problems with corresponding annotated programs.
arXiv Detail & Related papers (2021-05-30T12:34:17Z) - Inter-GPS: Interpretable Geometry Problem Solving with Formal Language
and Symbolic Reasoning [123.06420835072225]
We construct a new large-scale benchmark, Geometry3K, consisting of 3,002 geometry problems with dense annotation in formal language.
We propose a novel geometry solving approach with formal language and symbolic reasoning, called Interpretable Geometry Problem solver (Inter-GPS)
Inter-GPS incorporates theorem knowledge as conditional rules and performs symbolic reasoning step by step.
arXiv Detail & Related papers (2021-05-10T07:46:55Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.