Deep Koopman operator framework for causal discovery in nonlinear dynamical systems
- URL: http://arxiv.org/abs/2505.14828v1
- Date: Tue, 20 May 2025 18:43:48 GMT
- Title: Deep Koopman operator framework for causal discovery in nonlinear dynamical systems
- Authors: Juan Nathaniel, Carla Roesch, Jatan Buch, Derek DeSantis, Adam Rupe, Kara Lamb, Pierre Gentine,
- Abstract summary: We use a deep Koopman operator-theoretic formalism to develop a novel causal discovery algorithm, Kausal.<n>kausal aims to identify cause-effect mechanisms for better scientific understanding, explainable decision-making, and more accurate modeling.<n>Our numerical experiments demonstrate Kausal's superior ability in discovering and characterizing causal signals.
- Score: 0.08813517558695427
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: We use a deep Koopman operator-theoretic formalism to develop a novel causal discovery algorithm, Kausal. Causal discovery aims to identify cause-effect mechanisms for better scientific understanding, explainable decision-making, and more accurate modeling. Standard statistical frameworks, such as Granger causality, lack the ability to quantify causal relationships in nonlinear dynamics due to the presence of complex feedback mechanisms, timescale mixing, and nonstationarity. This presents a challenge in studying many real-world systems, such as the Earth's climate. Meanwhile, Koopman operator methods have emerged as a promising tool for approximating nonlinear dynamics in a linear space of observables. In Kausal, we propose to leverage this powerful idea for causal analysis where optimal observables are inferred using deep learning. Causal estimates are then evaluated in a reproducing kernel Hilbert space, and defined as the distance between the marginal dynamics of the effect and the joint dynamics of the cause-effect observables. Our numerical experiments demonstrate Kausal's superior ability in discovering and characterizing causal signals compared to existing approaches of prescribed observables. Lastly, we extend our analysis to observations of El Ni\~no-Southern Oscillation highlighting our algorithm's applicability to real-world phenomena. Our code is available at https://github.com/juannat7/kausal.
Related papers
- When Counterfactual Reasoning Fails: Chaos and Real-World Complexity [1.9223856107206057]
We investigate the limitations of counterfactual reasoning within the framework of Structural Causal Models.<n>We find that realistic assumptions, such as low degrees of model uncertainty or chaotic dynamics, can result in counterintuitive outcomes.<n>This work urges caution when applying counterfactual reasoning in settings characterized by chaos and uncertainty.
arXiv Detail & Related papers (2025-03-31T08:14:51Z) - Since Faithfulness Fails: The Performance Limits of Neural Causal Discovery [1.8635507597668244]
We identify a fundamental limitation: neural networks cannot reliably distinguish between existing and non-existing causal relationships.<n>Our experiments reveal that neural networks, as used in contemporary causal discovery approaches, lack the precision needed to recover ground-truth graphs.
arXiv Detail & Related papers (2025-02-22T03:20:20Z) - Learning Robust State Observers using Neural ODEs (longer version) [1.0094821910320064]
We present a methodology for the design of state observers for nonlinear systems based on Neural ODEs, learning Luenberger-like observers and their nonlinear extension.
For tuneable KKL observers, the relationship between the design of the observer and its trade-off between convergence speed and robustness is analysed and used as a basis for improving the robustness of the learning-based observer in training.
arXiv Detail & Related papers (2022-12-01T20:58:39Z) - Mining Causality from Continuous-time Dynamics Models: An Application to
Tsunami Forecasting [22.434845478979604]
We propose a mechanism for mining causal structures from continuous-time models.
We train models to capture the causal structure by enforcing sparsity in the weights of the input layers of the dynamics models.
We apply our method to a real-world problem, namely tsunami forecasting, where the exact causal-structures are difficult to characterize.
arXiv Detail & Related papers (2022-10-10T18:53:13Z) - Absence of operator growth for average equal-time observables in
charge-conserved sectors of the Sachdev-Ye-Kitaev model [11.353329565587574]
Quantum scrambling plays an important role in understanding thermalization in closed quantum systems.
We show that scrambling is absent for disorder-averaged expectation values of observables.
We develop a cumulant expansion approach to approximate the evolution of equal-time observables.
arXiv Detail & Related papers (2022-10-05T17:47:52Z) - Discovering Latent Causal Variables via Mechanism Sparsity: A New
Principle for Nonlinear ICA [81.4991350761909]
Independent component analysis (ICA) refers to an ensemble of methods which formalize this goal and provide estimation procedure for practical application.
We show that the latent variables can be recovered up to a permutation if one regularizes the latent mechanisms to be sparse.
arXiv Detail & Related papers (2021-07-21T14:22:14Z) - The Causal Neural Connection: Expressiveness, Learnability, and
Inference [125.57815987218756]
An object called structural causal model (SCM) represents a collection of mechanisms and sources of random variation of the system under investigation.
In this paper, we show that the causal hierarchy theorem (Thm. 1, Bareinboim et al., 2020) still holds for neural models.
We introduce a special type of SCM called a neural causal model (NCM), and formalize a new type of inductive bias to encode structural constraints necessary for performing causal inferences.
arXiv Detail & Related papers (2021-07-02T01:55:18Z) - ACRE: Abstract Causal REasoning Beyond Covariation [90.99059920286484]
We introduce the Abstract Causal REasoning dataset for systematic evaluation of current vision systems in causal induction.
Motivated by the stream of research on causal discovery in Blicket experiments, we query a visual reasoning system with the following four types of questions in either an independent scenario or an interventional scenario.
We notice that pure neural models tend towards an associative strategy under their chance-level performance, whereas neuro-symbolic combinations struggle in backward-blocking reasoning.
arXiv Detail & Related papers (2021-03-26T02:42:38Z) - Leveraging Global Parameters for Flow-based Neural Posterior Estimation [90.21090932619695]
Inferring the parameters of a model based on experimental observations is central to the scientific method.
A particularly challenging setting is when the model is strongly indeterminate, i.e., when distinct sets of parameters yield identical observations.
We present a method for cracking such indeterminacy by exploiting additional information conveyed by an auxiliary set of observations sharing global parameters.
arXiv Detail & Related papers (2021-02-12T12:23:13Z) - Neural Dynamic Mode Decomposition for End-to-End Modeling of Nonlinear
Dynamics [49.41640137945938]
We propose a neural dynamic mode decomposition for estimating a lift function based on neural networks.
With our proposed method, the forecast error is backpropagated through the neural networks and the spectral decomposition.
Our experiments demonstrate the effectiveness of our proposed method in terms of eigenvalue estimation and forecast performance.
arXiv Detail & Related papers (2020-12-11T08:34:26Z) - Gradient Starvation: A Learning Proclivity in Neural Networks [97.02382916372594]
Gradient Starvation arises when cross-entropy loss is minimized by capturing only a subset of features relevant for the task.
This work provides a theoretical explanation for the emergence of such feature imbalance in neural networks.
arXiv Detail & Related papers (2020-11-18T18:52:08Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.