Symmetry and Complexity in Object-Centric Deep Active Inference Models
- URL: http://arxiv.org/abs/2304.14493v1
- Date: Fri, 14 Apr 2023 10:21:26 GMT
- Title: Symmetry and Complexity in Object-Centric Deep Active Inference Models
- Authors: Stefano Ferraro, Toon Van de Maele, Tim Verbelen, and Bart Dhoedt
- Abstract summary: We show how inherent symmetries of particular objects emerge as symmetries in the latent state space of the generative model learnt under deep active inference.
In particular, we focus on object-centric representations, which are trained from pixels to predict novel object views as the agent moves its viewpoint.
- Score: 4.298360054690217
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Humans perceive and interact with hundreds of objects every day. In doing so,
they need to employ mental models of these objects and often exploit symmetries
in the object's shape and appearance in order to learn generalizable and
transferable skills. Active inference is a first principles approach to
understanding and modeling sentient agents. It states that agents entertain a
generative model of their environment, and learn and act by minimizing an upper
bound on their surprisal, i.e. their Free Energy. The Free Energy decomposes
into an accuracy and complexity term, meaning that agents favor the least
complex model, that can accurately explain their sensory observations. In this
paper, we investigate how inherent symmetries of particular objects also emerge
as symmetries in the latent state space of the generative model learnt under
deep active inference. In particular, we focus on object-centric
representations, which are trained from pixels to predict novel object views as
the agent moves its viewpoint. First, we investigate the relation between model
complexity and symmetry exploitation in the state space. Second, we do a
principal component analysis to demonstrate how the model encodes the principal
axis of symmetry of the object in the latent space. Finally, we also
demonstrate how more symmetrical representations can be exploited for better
generalization in the context of manipulation.
Related papers
- Probing the effects of broken symmetries in machine learning [0.0]
We show that non-symmetric models can learn symmetries from data, and that doing so can even be beneficial for the accuracy of the model.
We focus specifically on physical observables that are likely to be affected -- directly or indirectly -- by symmetry breaking, finding negligible consequences when the model is used in an interpolative, bulk, regime.
arXiv Detail & Related papers (2024-06-25T17:34:09Z) - Binding Dynamics in Rotating Features [72.80071820194273]
We propose an alternative "cosine binding" mechanism, which explicitly computes the alignment between features and adjusts weights accordingly.
This allows us to draw direct connections to self-attention and biological neural processes, and to shed light on the fundamental dynamics for object-centric representations to emerge in Rotating Features.
arXiv Detail & Related papers (2024-02-08T12:31:08Z) - Learning Physical Dynamics with Subequivariant Graph Neural Networks [99.41677381754678]
Graph Neural Networks (GNNs) have become a prevailing tool for learning physical dynamics.
Physical laws abide by symmetry, which is a vital inductive bias accounting for model generalization.
Our model achieves on average over 3% enhancement in contact prediction accuracy across 8 scenarios on Physion and 2X lower rollout MSE on RigidFall.
arXiv Detail & Related papers (2022-10-13T10:00:30Z) - Robust and Controllable Object-Centric Learning through Energy-based
Models [95.68748828339059]
ours is a conceptually simple and general approach to learning object-centric representations through an energy-based model.
We show that ours can be easily integrated into existing architectures and can effectively extract high-quality object-centric representations.
arXiv Detail & Related papers (2022-10-11T15:11:15Z) - Disentangling Shape and Pose for Object-Centric Deep Active Inference
Models [4.298360054690217]
We consider the problem of 3D object representation, and focus on different instances of the ShapeNet dataset.
We propose a model that factorizes object shape, pose and category, while still learning a representation for each factor using a deep neural network.
arXiv Detail & Related papers (2022-09-16T12:53:49Z) - Sparse Relational Reasoning with Object-Centric Representations [78.83747601814669]
We investigate the composability of soft-rules learned by relational neural architectures when operating over object-centric representations.
We find that increasing sparsity, especially on features, improves the performance of some models and leads to simpler relations.
arXiv Detail & Related papers (2022-07-15T14:57:33Z) - Symmetry-Based Representations for Artificial and Biological General
Intelligence [4.39338211982718]
We argue that symmetry transformations are a fundamental principle that can guide our search for what makes a good representation.
symmetries have started to gain prominence in machine learning too, resulting in more data efficient and generalisable algorithms.
First demonstrations of the importance of symmetry transformations for representation learning in the brain are starting to arise in neuroscience.
arXiv Detail & Related papers (2022-03-17T11:18:34Z) - Suspected Object Matters: Rethinking Model's Prediction for One-stage
Visual Grounding [93.82542533426766]
We propose a Suspected Object Transformation mechanism (SOT) to encourage the target object selection among the suspected ones.
SOT can be seamlessly integrated into existing CNN and Transformer-based one-stage visual grounders.
Extensive experiments demonstrate the effectiveness of our proposed method.
arXiv Detail & Related papers (2022-03-10T06:41:07Z) - Visual Grounding of Learned Physical Models [66.04898704928517]
Humans intuitively recognize objects' physical properties and predict their motion, even when the objects are engaged in complicated interactions.
We present a neural model that simultaneously reasons about physics and makes future predictions based on visual and dynamics priors.
Experiments show that our model can infer the physical properties within a few observations, which allows the model to quickly adapt to unseen scenarios and make accurate predictions into the future.
arXiv Detail & Related papers (2020-04-28T17:06:38Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.