Extending Structural Causal Models for Use in Autonomous Embodied Systems
- URL: http://arxiv.org/abs/2406.01384v1
- Date: Mon, 3 Jun 2024 14:47:05 GMT
- Title: Extending Structural Causal Models for Use in Autonomous Embodied Systems
- Authors: Rhys Howard, Lars Kunze,
- Abstract summary: We present a case study in which we describe a module-based autonomous driving system comprised of structural causal models (SCMs)
The first of these is SCM contexts, with the remainder being three new variable categories -- two of which are based upon functional programming monads.
We conclude by presenting an example application of the causal capabilities of the autonomous driving system.
- Score: 5.309950889075669
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Much work has been done to develop causal reasoning techniques across a number of domains, however the utilisation of causality within autonomous systems is still in its infancy. Autonomous systems would greatly benefit from the integration of causality through the use of representations such as structural causal models (SCMs). The system would be afforded a higher level of transparency, it would enable post-hoc explanations of outcomes, and assist in the online inference of exogenous variables. These qualities are either directly beneficial to the autonomous system or a valuable step in building public trust and informing regulation. To such an end we present a case study in which we describe a module-based autonomous driving system comprised of SCMs. Approaching this task requires considerations of a number of challenges when dealing with a system of great complexity and size, that must operate for extended periods of time by itself. Here we describe these challenges, and present solutions. The first of these is SCM contexts, with the remainder being three new variable categories -- two of which are based upon functional programming monads. Finally, we conclude by presenting an example application of the causal capabilities of the autonomous driving system. In this example, we aim to attribute culpability between vehicular agents in a hypothetical road collision incident.
Related papers
- Understanding the differences in Foundation Models: Attention, State Space Models, and Recurrent Neural Networks [50.29356570858905]
We introduce the Dynamical Systems Framework (DSF), which allows a principled investigation of all these architectures in a common representation.
We provide principled comparisons between softmax attention and other model classes, discussing the theoretical conditions under which softmax attention can be approximated.
This shows the DSF's potential to guide the systematic development of future more efficient and scalable foundation models.
arXiv Detail & Related papers (2024-05-24T17:19:57Z) - Hallucination Detection in Foundation Models for Decision-Making: A Flexible Definition and Review of the State of the Art [7.072820266877787]
We discuss the current use cases of foundation models for decision-making tasks.
We argue there is a need to step back and simultaneously design systems that can quantify the certainty of a model's decision.
arXiv Detail & Related papers (2024-03-25T08:11:02Z) - Can Large Language Models Learn Independent Causal Mechanisms? [9.274428418715347]
Large Language Models (LLMs) fall short on the same tasks in uncommon settings or with distribution shifts.
We show that causal models, that learn abstract variables and causal relationships, can demonstrate increased robustness against changes in the distribution.
arXiv Detail & Related papers (2024-02-04T23:04:02Z) - Distribution-consistency Structural Causal Models [6.276417011421679]
We introduce a novel textitdistribution-consistency assumption, and in alignment with it, we propose the Distribution-consistency Structural Causal Models (DiscoSCMs)
To concretely reveal the enhanced model capacity, we introduce a new identifiable causal parameter, textitthe probability of consistency, which holds practical significance within DiscoSCM alone.
arXiv Detail & Related papers (2024-01-29T06:46:15Z) - Targeted Reduction of Causal Models [55.11778726095353]
Causal Representation Learning offers a promising avenue to uncover interpretable causal patterns in simulations.
We introduce Targeted Causal Reduction (TCR), a method for condensing complex intervenable models into a concise set of causal factors.
Its ability to generate interpretable high-level explanations from complex models is demonstrated on toy and mechanical systems.
arXiv Detail & Related papers (2023-11-30T15:46:22Z) - Modular Deep Learning [120.36599591042908]
Transfer learning has recently become the dominant paradigm of machine learning.
It remains unclear how to develop models that specialise towards multiple tasks without incurring negative interference.
Modular deep learning has emerged as a promising solution to these challenges.
arXiv Detail & Related papers (2023-02-22T18:11:25Z) - Leveraging the structure of dynamical systems for data-driven modeling [111.45324708884813]
We consider the impact of the training set and its structure on the quality of the long-term prediction.
We show how an informed design of the training set, based on invariants of the system and the structure of the underlying attractor, significantly improves the resulting models.
arXiv Detail & Related papers (2021-12-15T20:09:20Z) - Disentangling Identifiable Features from Noisy Data with Structured
Nonlinear ICA [4.340954888479091]
We introduce a new general identifiable framework for principled disentanglement referred to as Structured Independent Component Analysis (SNICA)
Our contribution is to extend the identifiability theory of deep generative models for a very broad class of structured models.
We establish the major result that identifiability for this framework holds even in the presence of noise of unknown distribution.
arXiv Detail & Related papers (2021-06-17T15:56:57Z) - Self-consistent theory of mobility edges in quasiperiodic chains [62.997667081978825]
We introduce a self-consistent theory of mobility edges in nearest-neighbour tight-binding chains with quasiperiodic potentials.
mobility edges are generic in quasiperiodic systems which lack the energy-independent self-duality of the commonly studied Aubry-Andr'e-Harper model.
arXiv Detail & Related papers (2020-12-02T19:00:09Z) - S2RMs: Spatially Structured Recurrent Modules [105.0377129434636]
We take a step towards exploiting dynamic structure that are capable of simultaneously exploiting both modular andtemporal structures.
We find our models to be robust to the number of available views and better capable of generalization to novel tasks without additional training.
arXiv Detail & Related papers (2020-07-13T17:44:30Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.