Neurosymbolic Systems of Perception & Cognition: The Role of Attention
- URL: http://arxiv.org/abs/2112.01603v1
- Date: Thu, 2 Dec 2021 20:53:14 GMT
- Title: Neurosymbolic Systems of Perception & Cognition: The Role of Attention
- Authors: Hugo Latapie, Ozkan Kilic, Kristinn R. Thorisson, Pei Wang, Patrick
Hammer
- Abstract summary: A cognitive architecture aimed at cumulative learning must provide the necessary information and control structures.
We argue that knowledge at any level of abstraction involves what we refer to as neurosymbolic information.
- Score: 6.199795410316599
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: A cognitive architecture aimed at cumulative learning must provide the
necessary information and control structures to allow agents to learn
incrementally and autonomously from their experience. This involves managing an
agent's goals as well as continuously relating sensory information to these in
its perception-cognition information stack. The more varied the environment of
a learning agent is, the more general and flexible must be these mechanisms to
handle a wider variety of relevant patterns, tasks, and goal structures. While
many researchers agree that information at different levels of abstraction
likely differs in its makeup and structure and processing mechanisms, agreement
on the particulars of such differences is not generally shared in the research
community. A binary processing architecture (often referred to as System-1 and
System-2) has been proposed as a model of cognitive processing for low- and
high-level information, respectively. We posit that cognition is not binary in
this way and that knowledge at any level of abstraction involves what we refer
to as neurosymbolic information, meaning that data at both high and low levels
must contain both symbolic and subsymbolic information. Further, we argue that
the main differentiating factor between the processing of high and low levels
of data abstraction can be largely attributed to the nature of the involved
attention mechanisms. We describe the key arguments behind this view and review
relevant evidence from the literature.
Related papers
- A Quantifiable Information-Processing Hierarchy Provides a Necessary Condition for Detecting Agency [1.088537320059347]
Existing definitions tend to rely on top-down descriptions that are difficult to quantify.<n>We propose a bottom-up framework grounded in a system's information-processing order.<n>We illustrate the framework with neurophysiological and computational examples, including thermostats and receptor-like memristors.
arXiv Detail & Related papers (2026-01-07T01:26:20Z) - Cognitive Mirrors: Exploring the Diverse Functional Roles of Attention Heads in LLM Reasoning [54.12174882424842]
Large language models (LLMs) have achieved state-of-the-art performance in a variety of tasks, but remain largely opaque in terms of their internal mechanisms.<n>We propose a novel interpretability framework to systematically analyze the roles and behaviors of attention heads.
arXiv Detail & Related papers (2025-12-03T10:24:34Z) - Causal Structure and Representation Learning with Biomedical Applications [23.11061319442]
We outline a statistical and computational framework for causal structure and representation learning motivated by fundamental biomedical questions.<n>We show how to effectively use observational and perturbational data to perform causal discovery on observed causal variables.
arXiv Detail & Related papers (2025-11-06T20:17:58Z) - How Metacognitive Architectures Remember Their Own Thoughts: A Systematic Review [16.35521789216079]
We review how Computational Metacognitive Architectures (CMAs) model, store, remember and process their metacognitive experiences.
We consider different aspects - ranging from the underlying psychological theories to the content and structure of collected data, to the algorithms used and evaluation results.
arXiv Detail & Related papers (2025-02-28T08:48:41Z) - Identifiable Causal Representation Learning: Unsupervised, Multi-View, and Multi-Environment [10.814585613336778]
Causal representation learning aims to combine the core strengths of machine learning and causality.
This thesis investigates what is possible for CRL without direct supervision, and thus contributes to its theoretical foundations.
arXiv Detail & Related papers (2024-06-19T09:14:40Z) - Disentangling and Integrating Relational and Sensory Information in Transformer Architectures [2.5322020135765464]
We distinguish between two types of information: sensory information about the properties of individual objects, and relational information about the relationships between objects.
We propose an architectural extension of the Transformer framework, featuring two distinct attention mechanisms: sensory attention for directing the flow of sensory information, and a novel relational attention mechanism for directing the flow of relational information.
arXiv Detail & Related papers (2024-05-26T23:52:51Z) - Neural Causal Abstractions [63.21695740637627]
We develop a new family of causal abstractions by clustering variables and their domains.
We show that such abstractions are learnable in practical settings through Neural Causal Models.
Our experiments support the theory and illustrate how to scale causal inferences to high-dimensional settings involving image data.
arXiv Detail & Related papers (2024-01-05T02:00:27Z) - Synergistic information supports modality integration and flexible
learning in neural networks solving multiple tasks [107.8565143456161]
We investigate the information processing strategies adopted by simple artificial neural networks performing a variety of cognitive tasks.
Results show that synergy increases as neural networks learn multiple diverse tasks.
randomly turning off neurons during training through dropout increases network redundancy, corresponding to an increase in robustness.
arXiv Detail & Related papers (2022-10-06T15:36:27Z) - AIGenC: An AI generalisation model via creativity [1.933681537640272]
Inspired by cognitive theories of creativity, this paper introduces a computational model (AIGenC)
It lays down the necessary components to enable artificial agents to learn, use and generate transferable representations.
We discuss the model's capability to yield better out-of-distribution generalisation in artificial agents.
arXiv Detail & Related papers (2022-05-19T17:43:31Z) - Understand me, if you refer to Aspect Knowledge: Knowledge-aware Gated
Recurrent Memory Network [54.735400754548635]
Aspect-level sentiment classification (ASC) aims to predict the fine-grained sentiment polarity towards a given aspect mentioned in a review.
Despite recent advances in ASC, enabling machines to preciously infer aspect sentiments is still challenging.
This paper tackles two challenges in ASC: (1) due to lack of aspect knowledge, aspect representation is inadequate to represent aspect's exact meaning and property information; (2) prior works only capture either local syntactic information or global relational information, thus missing either one of them leads to insufficient syntactic information.
arXiv Detail & Related papers (2021-08-05T03:39:30Z) - Affect Analysis in-the-wild: Valence-Arousal, Expressions, Action Units
and a Unified Framework [83.21732533130846]
The paper focuses on large in-the-wild databases, i.e., Aff-Wild and Aff-Wild2.
It presents the design of two classes of deep neural networks trained with these databases.
A novel multi-task and holistic framework is presented which is able to jointly learn and effectively generalize and perform affect recognition.
arXiv Detail & Related papers (2021-03-29T17:36:20Z) - Learning to Combine Top-Down and Bottom-Up Signals in Recurrent Neural
Networks with Attention over Modules [81.1967157385085]
Robust perception relies on both bottom-up and top-down signals.
We explore deep recurrent neural net architectures in which bottom-up and top-down signals are dynamically combined using attention.
We demonstrate on a variety of benchmarks in language modeling, sequential image classification, video prediction and reinforcement learning that the emphbidirectional information flow can improve results over strong baselines.
arXiv Detail & Related papers (2020-06-30T17:26:19Z) - Neuro-symbolic Architectures for Context Understanding [59.899606495602406]
We propose the use of hybrid AI methodology as a framework for combining the strengths of data-driven and knowledge-driven approaches.
Specifically, we inherit the concept of neuro-symbolism as a way of using knowledge-bases to guide the learning progress of deep neural networks.
arXiv Detail & Related papers (2020-03-09T15:04:07Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.