Abstract Spatial-Temporal Reasoning via Probabilistic Abduction and
Execution
- URL: http://arxiv.org/abs/2103.14230v1
- Date: Fri, 26 Mar 2021 02:42:18 GMT
- Title: Abstract Spatial-Temporal Reasoning via Probabilistic Abduction and
Execution
- Authors: Chi Zhang, Baoxiong Jia, Song-Chun Zhu, Yixin Zhu
- Abstract summary: Spatial-temporal reasoning is a challenging task in Artificial Intelligence (AI)
Recent works have focused on an abstract reasoning task of this kind -- Raven's Progressive Matrices ( RPM)
We propose a neuro-symbolic Probabilistic Abduction and Execution learner (PrAE) learner.
- Score: 97.50813120600026
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Spatial-temporal reasoning is a challenging task in Artificial Intelligence
(AI) due to its demanding but unique nature: a theoretic requirement on
representing and reasoning based on spatial-temporal knowledge in mind, and an
applied requirement on a high-level cognitive system capable of navigating and
acting in space and time. Recent works have focused on an abstract reasoning
task of this kind -- Raven's Progressive Matrices (RPM). Despite the
encouraging progress on RPM that achieves human-level performance in terms of
accuracy, modern approaches have neither a treatment of human-like reasoning on
generalization, nor a potential to generate answers. To fill in this gap, we
propose a neuro-symbolic Probabilistic Abduction and Execution (PrAE) learner;
central to the PrAE learner is the process of probabilistic abduction and
execution on a probabilistic scene representation, akin to the mental
manipulation of objects. Specifically, we disentangle perception and reasoning
from a monolithic model. The neural visual perception frontend predicts
objects' attributes, later aggregated by a scene inference engine to produce a
probabilistic scene representation. In the symbolic logical reasoning backend,
the PrAE learner uses the representation to abduce the hidden rules. An answer
is predicted by executing the rules on the probabilistic representation. The
entire system is trained end-to-end in an analysis-by-synthesis manner without
any visual attribute annotations. Extensive experiments demonstrate that the
PrAE learner improves cross-configuration generalization and is capable of
rendering an answer, in contrast to prior works that merely make a categorical
choice from candidates.
Related papers
- A Feature-based Generalizable Prediction Model for Both Perceptual and
Abstract Reasoning [1.0650780147044159]
A hallmark of human intelligence is the ability to infer abstract rules from limited experience.
Recent advances in deep learning have led to multiple artificial neural network models matching or even surpassing human performance.
We present an algorithmic approach to rule detection and application using feature detection, affine transformation estimation and search.
arXiv Detail & Related papers (2024-03-08T19:26:30Z) - A challenge in A(G)I, cybernetics revived in the Ouroboros Model as one
algorithm for all thinking [0.0]
The aim of the paper is to highlight strengths and deficiencies of current Artificial Intelligence approaches.
It is proposed to take a wide step back and to newly incorporate aspects of cybernetics and analog control processes.
arXiv Detail & Related papers (2024-03-07T07:39:54Z) - Towards Generative Abstract Reasoning: Completing Raven's Progressive Matrix via Rule Abstraction and Selection [52.107043437362556]
Raven's Progressive Matrix (RPM) is widely used to probe abstract visual reasoning in machine intelligence.
Participators of RPM tests can show powerful reasoning ability by inferring and combining attribute-changing rules.
We propose a deep latent variable model for answer generation problems through Rule AbstractIon and SElection.
arXiv Detail & Related papers (2024-01-18T13:28:44Z) - Stochastic Coherence Over Attention Trajectory For Continuous Learning
In Video Streams [64.82800502603138]
This paper proposes a novel neural-network-based approach to progressively and autonomously develop pixel-wise representations in a video stream.
The proposed method is based on a human-like attention mechanism that allows the agent to learn by observing what is moving in the attended locations.
Our experiments leverage 3D virtual environments and they show that the proposed agents can learn to distinguish objects just by observing the video stream.
arXiv Detail & Related papers (2022-04-26T09:52:31Z) - Learning Theory of Mind via Dynamic Traits Attribution [59.9781556714202]
We propose a new neural ToM architecture that learns to generate a latent trait vector of an actor from the past trajectories.
This trait vector then multiplicatively modulates the prediction mechanism via a fast weights' scheme in the prediction neural network.
We empirically show that the fast weights provide a good inductive bias to model the character traits of agents and hence improves mindreading ability.
arXiv Detail & Related papers (2022-04-17T11:21:18Z) - Visual Abductive Reasoning [85.17040703205608]
Abductive reasoning seeks the likeliest possible explanation for partial observations.
We propose a new task and dataset, Visual Abductive Reasoning ( VAR), for examining abductive reasoning ability of machine intelligence in everyday visual situations.
arXiv Detail & Related papers (2022-03-26T10:17:03Z) - What does it mean to represent? Mental representations as falsifiable
memory patterns [8.430851504111585]
We argue that causal and teleological approaches fail to provide a satisfactory account of representation.
We sketch an alternative according to which representations correspond to inferred latent structures in the world.
These structures are assumed to have certain properties objectively, which allows for planning, prediction, and detection of unexpected events.
arXiv Detail & Related papers (2022-03-06T12:52:42Z) - Learning Algebraic Representation for Systematic Generalization in
Abstract Reasoning [109.21780441933164]
We propose a hybrid approach to improve systematic generalization in reasoning.
We showcase a prototype with algebraic representation for the abstract spatial-temporal task of Raven's Progressive Matrices (RPM)
We show that the algebraic representation learned can be decoded by isomorphism to generate an answer.
arXiv Detail & Related papers (2021-11-25T09:56:30Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.