Related papers: Learning What Matters: Causal Time Series Modeling for Arctic Sea Ice Prediction

Learning What Matters: Causal Time Series Modeling for Arctic Sea Ice Prediction

URL: http://arxiv.org/abs/2509.09128v1
Date: Thu, 11 Sep 2025 03:54:39 GMT
Title: Learning What Matters: Causal Time Series Modeling for Arctic Sea Ice Prediction
Authors: Emam Hossain, Md Osman Gani,
Abstract summary: We introduce a causality-aware deep learning framework for causal feature selection within a hybrid neural architecture.<n>The proposed method identifies causally influential predictors, prioritizes direct causes of SIE dynamics, reduces unnecessary features, and enhances computational efficiency.<n> Experimental results show that incorporating causal inputs leads to improved prediction accuracy and interpretability across varying lead times.
Score: 2.1141584811533645
License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
Abstract: Conventional machine learning and deep learning models typically rely on correlation-based learning, which often fails to distinguish genuine causal relationships from spurious associations, limiting their robustness, interpretability, and ability to generalize. To overcome these limitations, we introduce a causality-aware deep learning framework that integrates Multivariate Granger Causality (MVGC) and PCMCI+ for causal feature selection within a hybrid neural architecture. Leveraging 43 years (1979-2021) of Arctic Sea Ice Extent (SIE) data and associated ocean-atmospheric variables at daily and monthly resolutions, the proposed method identifies causally influential predictors, prioritizes direct causes of SIE dynamics, reduces unnecessary features, and enhances computational efficiency. Experimental results show that incorporating causal inputs leads to improved prediction accuracy and interpretability across varying lead times. While demonstrated on Arctic SIE forecasting, the framework is broadly applicable to other dynamic, high-dimensional domains, offering a scalable approach that advances both the theoretical foundations and practical performance of causality-informed predictive modeling.

Related papers

Time-Varying Causal Treatment for Quantifying the Causal Effect of Short-Term Variations on Arctic Sea Ice Dynamics [0.16206783799607727]
Quantifying the causal relationship between ice melt and freshwater distribution is critical, as these complex interactions manifest as regional fluctuations in sea surface height.<n>We propose the Knowledge-Guided Variational Autoencoder (KGCM-VAE) to quantify causal mechanisms between sea ice thickness and SSH.<n> Experimental results on both synthetic and real-world Arctic datasets demonstrate that KGCM-VAE superior PEHE compared to state-of-the-art benchmarks.
arXiv Detail & Related papers (2026-01-25T01:44:55Z)
Bridging the Gap Between Bayesian Deep Learning and Ensemble Weather Forecasts [100.26854618129039]
Weather forecasting is fundamentally challenged by the chaotic nature of the atmosphere.<n>Recent advances in Bayesian Deep Learning (BDL) offer a promising but often disconnected alternative.<n>We bridge these paradigms through a unified hybrid BDL framework for ensemble weather forecasting.
arXiv Detail & Related papers (2025-11-18T07:49:52Z)
Correlation to Causation: A Causal Deep Learning Framework for Arctic Sea Ice Prediction [3.868211565468035]
We propose a causality-driven deep learning framework that integrates causal discovery algorithms with a hybrid deep learning architecture.<n>Our approach identifies causally significant factors, prioritizes features with direct influence, reduces feature overhead, and improves computational efficiency.<n> Experiments demonstrate that integrating causal features enhances the deep learning model's predictive accuracy and interpretability across multiple lead times.
arXiv Detail & Related papers (2025-03-03T22:24:14Z)
An AI-powered Bayesian generative modeling approach for causal inference in observational studies [4.624176903641013]
CausalBGM is an AI-powered Bayesian generative modeling approach.<n>It estimates the individual treatment effect (ITE) by learning individual-specific distributions of a low-dimensional latent feature set.
arXiv Detail & Related papers (2025-01-01T06:52:45Z)
A PAC-Bayesian Perspective on the Interpolating Information Criterion [54.548058449535155]
We show how a PAC-Bayes bound is obtained for a general class of models, characterizing factors which influence performance in the interpolating regime. We quantify how the test error for overparameterized models achieving effectively zero training error depends on the quality of the implicit regularization imposed by e.g. the combination of model, parameter-initialization scheme.
arXiv Detail & Related papers (2023-11-13T01:48:08Z)
Interpretable Imitation Learning with Dynamic Causal Relations [65.18456572421702]
We propose to expose captured knowledge in the form of a directed acyclic causal graph. We also design this causal discovery process to be state-dependent, enabling it to model the dynamics in latent causal graphs. The proposed framework is composed of three parts: a dynamic causal discovery module, a causality encoding module, and a prediction module, and is trained in an end-to-end manner.
arXiv Detail & Related papers (2023-09-30T20:59:42Z)
Mining Causality from Continuous-time Dynamics Models: An Application to Tsunami Forecasting [22.434845478979604]
We propose a mechanism for mining causal structures from continuous-time models. We train models to capture the causal structure by enforcing sparsity in the weights of the input layers of the dynamics models. We apply our method to a real-world problem, namely tsunami forecasting, where the exact causal-structures are difficult to characterize.
arXiv Detail & Related papers (2022-10-10T18:53:13Z)
Optimized ensemble deep learning framework for scalable forecasting of dynamics containing extreme events [0.0]
Two machine learning techniques are jointly used to achieve synergistic improvements in model accuracy, stability, scalability, and prompting a new wave of applications in the forecasting of dynamics. The proposed OEDL model based on a best convex combination of feed-forward neural networks, reservoir computing, and long short-term memory can play a key role in advancing predictions of dynamics consisting of extreme events.
arXiv Detail & Related papers (2021-06-09T10:59:41Z)
Counterfactual Maximum Likelihood Estimation for Training Deep Networks [83.44219640437657]
Deep learning models are prone to learning spurious correlations that should not be learned as predictive clues. We propose a causality-based training framework to reduce the spurious correlations caused by observable confounders. We conduct experiments on two real-world tasks: Natural Language Inference (NLI) and Image Captioning.
arXiv Detail & Related papers (2021-06-07T17:47:16Z)
Learning Causal Semantic Representation for Out-of-Distribution Prediction [125.38836464226092]
We propose a Causal Semantic Generative model (CSG) based on a causal reasoning so that the two factors are modeled separately. We show that CSG can identify the semantic factor by fitting training data, and this semantic-identification guarantees the boundedness of OOD generalization error.
arXiv Detail & Related papers (2020-11-03T13:16:05Z)
Double Robust Representation Learning for Counterfactual Prediction [68.78210173955001]
We propose a novel scalable method to learn double-robust representations for counterfactual predictions. We make robust and efficient counterfactual predictions for both individual and average treatment effects. The algorithm shows competitive performance with the state-of-the-art on real world and synthetic data.
arXiv Detail & Related papers (2020-10-15T16:39:26Z)

This list is automatically generated from the titles and abstracts of the papers in this site.