Variational Causal Dynamics: Discovering Modular World Models from
Interventions
- URL: http://arxiv.org/abs/2206.11131v1
- Date: Wed, 22 Jun 2022 14:28:40 GMT
- Title: Variational Causal Dynamics: Discovering Modular World Models from
Interventions
- Authors: Anson Lei, Bernhard Sch\"olkopf, Ingmar Posner
- Abstract summary: Latent world models allow agents to reason about complex environments with high-dimensional observations.
We present variational causal dynamics (VCD), a structured world model that exploits the invariance of causal mechanisms across environments.
- Score: 25.084146613277973
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Latent world models allow agents to reason about complex environments with
high-dimensional observations. However, adapting to new environments and
effectively leveraging previous knowledge remain significant challenges. We
present variational causal dynamics (VCD), a structured world model that
exploits the invariance of causal mechanisms across environments to achieve
fast and modular adaptation. By causally factorising a transition model, VCD is
able to identify reusable components across different environments. This is
achieved by combining causal discovery and variational inference to learn a
latent representation and transition model jointly in an unsupervised manner.
Specifically, we optimise the evidence lower bound jointly over a
representation model and a transition model structured as a causal graphical
model. In evaluations on simulated environments with state and image
observations, we show that VCD is able to successfully identify causal
variables, and to discover consistent causal structures across different
environments. Moreover, given a small number of observations in a previously
unseen, intervened environment, VCD is able to identify the sparse changes in
the dynamics and to adapt efficiently. In doing so, VCD significantly extends
the capabilities of the current state-of-the-art in latent world models while
also comparing favourably in terms of prediction accuracy.
Related papers
- SPARTAN: A Sparse Transformer Learning Local Causation [63.29645501232935]
Causal structures play a central role in world models that flexibly adapt to changes in the environment.
We present the SPARse TrANsformer World model (SPARTAN), a Transformer-based world model that learns local causal structures between entities in a scene.
By applying sparsity regularisation on the attention pattern between object-factored tokens, SPARTAN identifies sparse local causal models that accurately predict future object states.
arXiv Detail & Related papers (2024-11-11T11:42:48Z) - Identifiable Representation and Model Learning for Latent Dynamic Systems [0.0]
We study the problem of identifiable representation and model learning for latent dynamic systems.
We prove that, for linear or affine nonlinear latent dynamic systems, it is possible to identify the representations up to scaling and determine the models up to some simple transformations.
arXiv Detail & Related papers (2024-10-23T13:55:42Z) - Integrated Dynamic Phenological Feature for Remote Sensing Image Land Cover Change Detection [5.109855690325439]
We introduce the InPhea model, which integrates phenological features into a remote sensing image CD framework.
A constrainer with four constraint modules and a multi-stage contrastive learning approach is employed to aid in the model's understanding of phenological characteristics.
Experiments on the HRSCD, SECD, and PSCD-Wuhan datasets reveal that InPhea outperforms other models.
arXiv Detail & Related papers (2024-08-08T01:07:28Z) - Synthetic location trajectory generation using categorical diffusion
models [50.809683239937584]
Diffusion models (DPMs) have rapidly evolved to be one of the predominant generative models for the simulation of synthetic data.
We propose using DPMs for the generation of synthetic individual location trajectories (ILTs) which are sequences of variables representing physical locations visited by individuals.
arXiv Detail & Related papers (2024-02-19T15:57:39Z) - PGODE: Towards High-quality System Dynamics Modeling [40.76121531452706]
This paper studies the problem of modeling multi-agent dynamical systems, where agents could interact mutually to influence their behaviors.
Recent research predominantly uses geometric graphs to depict these mutual interactions, which are then captured by graph neural networks (GNNs)
We propose a new approach named Prototypical Graph ODE to address the problem.
arXiv Detail & Related papers (2023-11-11T12:04:47Z) - Identifiable Latent Polynomial Causal Models Through the Lens of Change [82.14087963690561]
Causal representation learning aims to unveil latent high-level causal representations from observed low-level data.
One of its primary tasks is to provide reliable assurance of identifying these latent causal models, known as identifiability.
arXiv Detail & Related papers (2023-10-24T07:46:10Z) - On the Interventional Kullback-Leibler Divergence [11.57430292133273]
We introduce the Interventional Kullback-Leibler divergence to quantify both structural and distributional differences between causal models.
We propose a sufficient condition on the intervention targets to identify subsets of observed variables on which the models provably agree or disagree.
arXiv Detail & Related papers (2023-02-10T17:03:29Z) - Style-Hallucinated Dual Consistency Learning: A Unified Framework for
Visual Domain Generalization [113.03189252044773]
We propose a unified framework, Style-HAllucinated Dual consistEncy learning (SHADE), to handle domain shift in various visual tasks.
Our versatile SHADE can significantly enhance the generalization in various visual recognition tasks, including image classification, semantic segmentation and object detection.
arXiv Detail & Related papers (2022-12-18T11:42:51Z) - Towards Robust and Adaptive Motion Forecasting: A Causal Representation
Perspective [72.55093886515824]
We introduce a causal formalism of motion forecasting, which casts the problem as a dynamic process with three groups of latent variables.
We devise a modular architecture that factorizes the representations of invariant mechanisms and style confounders to approximate a causal graph.
Experiment results on synthetic and real datasets show that our three proposed components significantly improve the robustness and reusability of the learned motion representations.
arXiv Detail & Related papers (2021-11-29T18:59:09Z) - Causal Discovery in Physical Systems from Videos [123.79211190669821]
Causal discovery is at the core of human cognition.
We consider the task of causal discovery from videos in an end-to-end fashion without supervision on the ground-truth graph structure.
arXiv Detail & Related papers (2020-07-01T17:29:57Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.