A Hybrid System for Systematic Generalization in Simple Arithmetic
Problems
- URL: http://arxiv.org/abs/2306.17249v1
- Date: Thu, 29 Jun 2023 18:35:41 GMT
- Title: A Hybrid System for Systematic Generalization in Simple Arithmetic
Problems
- Authors: Flavio Petruzzellis, Alberto Testolin, Alessandro Sperduti
- Abstract summary: We propose a hybrid system capable of solving arithmetic problems that require compositional and systematic reasoning over sequences of symbols.
We show that the proposed system can accurately solve nested arithmetical expressions even when trained only on a subset including the simplest cases.
- Score: 70.91780996370326
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Solving symbolic reasoning problems that require compositionality and
systematicity is considered one of the key ingredients of human intelligence.
However, symbolic reasoning is still a great challenge for deep learning
models, which often cannot generalize the reasoning pattern to
out-of-distribution test cases. In this work, we propose a hybrid system
capable of solving arithmetic problems that require compositional and
systematic reasoning over sequences of symbols. The model acquires such a skill
by learning appropriate substitution rules, which are applied iteratively to
the input string until the expression is completely resolved. We show that the
proposed system can accurately solve nested arithmetical expressions even when
trained only on a subset including the simplest cases, significantly
outperforming both a sequence-to-sequence model trained end-to-end and a
state-of-the-art large language model.
Related papers
- Quantifying artificial intelligence through algebraic generalization [19.999962047304596]
Modern AI systems fall short on tests requiring symbolic processing and abstraction.
No comprehensive and theoretically-motivated framework exists to quantify reasoning in AI systems.
Here, we adopt a framework from computational complexity theory to explicitly quantify symbolic generalization.
arXiv Detail & Related papers (2024-11-08T20:08:18Z) - A Neural Rewriting System to Solve Algorithmic Problems [47.129504708849446]
We propose a modular architecture designed to learn a general procedure for solving nested mathematical formulas.
Inspired by rewriting systems, a classic framework in symbolic artificial intelligence, we include in the architecture three specialized and interacting modules.
We benchmark our system against the Neural Data Router, a recent model specialized for systematic generalization, and a state-of-the-art large language model (GPT-4) probed with advanced prompting strategies.
arXiv Detail & Related papers (2024-02-27T10:57:07Z) - Do Deep Neural Networks Capture Compositionality in Arithmetic
Reasoning? [31.692400722222278]
We introduce a skill tree on compositionality in arithmetic symbolic reasoning that defines the hierarchical levels of complexity along with three compositionality dimensions: systematicity, productivity, and substitutivity.
Our experiments revealed that among the three types of composition, the models struggled most with systematicity, performing poorly even with relatively simple compositions.
arXiv Detail & Related papers (2023-02-15T18:59:04Z) - Learning to Reason With Relational Abstractions [65.89553417442049]
We study how to build stronger reasoning capability in language models using the idea of relational abstractions.
We find that models that are supplied with such sequences as prompts can solve tasks with a significantly higher accuracy.
arXiv Detail & Related papers (2022-10-06T00:27:50Z) - End-to-end Algorithm Synthesis with Recurrent Networks: Logical
Extrapolation Without Overthinking [52.05847268235338]
We show how machine learning systems can perform logical extrapolation without overthinking problems.
We propose a recall architecture that keeps an explicit copy of the problem instance in memory so that it cannot be forgotten.
We also employ a progressive training routine that prevents the model from learning behaviors that are specific to number and instead pushes it to learn behaviors that can be repeated indefinitely.
arXiv Detail & Related papers (2022-02-11T18:43:28Z) - Symbolic Brittleness in Sequence Models: on Systematic Generalization in
Symbolic Mathematics [38.62999063710003]
We consider the problem of symbolic mathematical integration, as it requires generalizing systematically beyond the test set.
We develop a methodology for evaluating generalization that takes advantage of the problem domain's structure and access to a verifier.
We demonstrate challenges in achieving robustness, compositionality, and out-of-distribution generalization, through both carefully constructed manual test suites and a genetic algorithm.
arXiv Detail & Related papers (2021-09-28T18:50:15Z) - Relational reasoning and generalization using non-symbolic neural
networks [66.07793171648161]
Previous work suggested that neural networks were not suitable models of human relational reasoning because they could not represent mathematically identity, the most basic form of equality.
We find neural networks are able to learn basic equality (mathematical identity), (2) sequential equality problems (learning ABA-patterned sequences) with only positive training instances, and (3) a complex, hierarchical equality problem with only basic equality training instances.
These results suggest that essential aspects of symbolic reasoning can emerge from data-driven, non-symbolic learning processes.
arXiv Detail & Related papers (2020-06-14T18:25:42Z) - Machine Number Sense: A Dataset of Visual Arithmetic Problems for
Abstract and Relational Reasoning [95.18337034090648]
We propose a dataset, Machine Number Sense (MNS), consisting of visual arithmetic problems automatically generated using a grammar model--And-Or Graph (AOG)
These visual arithmetic problems are in the form of geometric figures.
We benchmark the MNS dataset using four predominant neural network models as baselines in this visual reasoning task.
arXiv Detail & Related papers (2020-04-25T17:14:58Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.