Plane Geometry Diagram Parsing
- URL: http://arxiv.org/abs/2205.09363v1
- Date: Thu, 19 May 2022 07:47:01 GMT
- Title: Plane Geometry Diagram Parsing
- Authors: Ming-Liang Zhang, Fei Yin, Yi-Han Hao, Cheng-Lin Liu
- Abstract summary: We propose a powerful diagram based on deep learning and graph reasoning.
A modified instance segmentation method is proposed to extract geometric primitives.
The graph neural network (GNN) is leveraged to realize relation parsing and primitive classification.
- Score: 29.921409628478152
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Geometry diagram parsing plays a key role in geometry problem solving,
wherein the primitive extraction and relation parsing remain challenging due to
the complex layout and between-primitive relationship. In this paper, we
propose a powerful diagram parser based on deep learning and graph reasoning.
Specifically, a modified instance segmentation method is proposed to extract
geometric primitives, and the graph neural network (GNN) is leveraged to
realize relation parsing and primitive classification incorporating geometric
features and prior knowledge. All the modules are integrated into an end-to-end
model called PGDPNet to perform all the sub-tasks simultaneously. In addition,
we build a new large-scale geometry diagram dataset named PGDP5K with primitive
level annotations. Experiments on PGDP5K and an existing dataset IMP-Geometry3K
show that our model outperforms state-of-the-art methods in four sub-tasks
remarkably. Our code, dataset and appendix material are available at
https://github.com/mingliangzhang2018/PGDP.
Related papers
- Geometric Inductive Biases of Deep Networks: The Role of Data and Architecture [22.225213114532533]
We argue that when training a neural network, the input space curvature remains invariant under transformation determined by its architecture.
We show that in cases where the average geometry is low-rank (such as in a ResNet), the geometry only changes in a subset of the input space.
arXiv Detail & Related papers (2024-10-15T19:46:09Z) - Metric-Semantic Factor Graph Generation based on Graph Neural Networks [0.0]
In indoors, certain spatial constraints, such as the relative positioning of planes, remain consistent despite variations in layout.
This paper explores how these invariant relationships can be captured in a graph SLAM framework by representing high-level concepts like rooms and walls.
Several efforts have tackled this issue with add-hoc solutions for each concept generation and with manually-defined factors.
This paper proposes a novel method for metric-semantic factor graph generation which includes defining a semantic scene graph, integrating geometric information, and learning the interconnecting factors.
arXiv Detail & Related papers (2024-09-18T13:24:44Z) - A Survey of Geometric Graph Neural Networks: Data Structures, Models and
Applications [67.33002207179923]
This paper presents a survey of data structures, models, and applications related to geometric GNNs.
We provide a unified view of existing models from the geometric message passing perspective.
We also summarize the applications as well as the related datasets to facilitate later research for methodology development and experimental evaluation.
arXiv Detail & Related papers (2024-03-01T12:13:04Z) - A Hitchhiker's Guide to Geometric GNNs for 3D Atomic Systems [87.30652640973317]
Recent advances in computational modelling of atomic systems represent them as geometric graphs with atoms embedded as nodes in 3D Euclidean space.
Geometric Graph Neural Networks have emerged as the preferred machine learning architecture powering applications ranging from protein structure prediction to molecular simulations and material generation.
This paper provides a comprehensive and self-contained overview of the field of Geometric GNNs for 3D atomic systems.
arXiv Detail & Related papers (2023-12-12T18:44:19Z) - PGDP5K: A Diagram Parsing Dataset for Plane Geometry Problems [0.0]
We propose a new large-scale geometry diagram dataset named PGDP5K and a novel annotation method.
Our dataset consists of 5000 diagram samples composed of 16 shapes, covering 5 positional relations, 22 symbol types and 6 text types.
Experiments on PGDP5K and IMP-Geometry3K datasets reveal that the state-of-the-art (SOTA) method achieves only 66.07% F1 value.
arXiv Detail & Related papers (2022-05-20T03:41:41Z) - Hermitian Symmetric Spaces for Graph Embeddings [0.0]
We learn continuous representations of graphs in spaces of symmetric matrices over C.
These spaces offer a rich geometry that simultaneously admits hyperbolic and Euclidean subspaces.
The proposed models are able to automatically adapt to very dissimilar arrangements without any apriori estimates of graph features.
arXiv Detail & Related papers (2021-05-11T18:14:52Z) - Learning Spatial Context with Graph Neural Network for Multi-Person Pose
Grouping [71.59494156155309]
Bottom-up approaches for image-based multi-person pose estimation consist of two stages: keypoint detection and grouping.
In this work, we formulate the grouping task as a graph partitioning problem, where we learn the affinity matrix with a Graph Neural Network (GNN)
The learned geometry-based affinity is further fused with appearance-based affinity to achieve robust keypoint association.
arXiv Detail & Related papers (2021-04-06T09:21:14Z) - Self-supervised Geometric Perception [96.89966337518854]
Self-supervised geometric perception is a framework to learn a feature descriptor for correspondence matching without any ground-truth geometric model labels.
We show that SGP achieves state-of-the-art performance that is on-par or superior to the supervised oracles trained using ground-truth labels.
arXiv Detail & Related papers (2021-03-04T15:34:43Z) - Primal-Dual Mesh Convolutional Neural Networks [62.165239866312334]
We propose a primal-dual framework drawn from the graph-neural-network literature to triangle meshes.
Our method takes features for both edges and faces of a 3D mesh as input and dynamically aggregates them.
We provide theoretical insights of our approach using tools from the mesh-simplification literature.
arXiv Detail & Related papers (2020-10-23T14:49:02Z) - Graph Geometry Interaction Learning [41.10468385822182]
We develop a novel Geometry Interaction Learning (GIL) method for graphs, a well-suited and efficient alternative for learning abundant geometric properties in graph.
Our method endows each node the freedom to determine the importance of each geometry space via a flexible dual feature interaction learning and probability assembling mechanism.
Promising experimental results are presented for five benchmark datasets on node classification and link prediction tasks.
arXiv Detail & Related papers (2020-10-23T02:40:28Z) - Geometrically Principled Connections in Graph Neural Networks [66.51286736506658]
We argue geometry should remain the primary driving force behind innovation in the emerging field of geometric deep learning.
We relate graph neural networks to widely successful computer graphics and data approximation models: radial basis functions (RBFs)
We introduce affine skip connections, a novel building block formed by combining a fully connected layer with any graph convolution operator.
arXiv Detail & Related papers (2020-04-06T13:25:46Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.