A Data-Driven Two-Phase Multi-Split Causal Ensemble Model for Time
Series
- URL: http://arxiv.org/abs/2403.04793v1
- Date: Mon, 4 Mar 2024 14:20:41 GMT
- Title: A Data-Driven Two-Phase Multi-Split Causal Ensemble Model for Time
Series
- Authors: Zhipeng Ma, Marco Kemmerling, Daniel Buschmann, Chrismarie Enslin,
Daniel L\"utticke, Robert H. Schmitt
- Abstract summary: Causal inference is a fundamental research topic for discovering the cause-effect relationships in many disciplines.
Some approaches may only be able to identify linear relationships, while others are applicable for non-linearities.
This publication proposes a novel data-driven multi-split causal ensemble model to combine the strengths of different causality base algorithms.
- Score: 0.8252665500568257
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: Causal inference is a fundamental research topic for discovering the
cause-effect relationships in many disciplines. However, not all algorithms are
equally well-suited for a given dataset. For instance, some approaches may only
be able to identify linear relationships, while others are applicable for
non-linearities. Algorithms further vary in their sensitivity to noise and
their ability to infer causal information from coupled vs. non-coupled time
series. Therefore, different algorithms often generate different causal
relationships for the same input. To achieve a more robust causal inference
result, this publication proposes a novel data-driven two-phase multi-split
causal ensemble model to combine the strengths of different causality base
algorithms. In comparison to existing approaches, the proposed ensemble method
reduces the influence of noise through a data partitioning scheme in the first
phase. To achieve this, the data are initially divided into several partitions
and the base algorithms are applied to each partition. Subsequently, Gaussian
mixture models are used to identify the causal relationships derived from the
different partitions that are likely to be valid. In the second phase, the
identified relationships from each base algorithm are then merged based on
three combination rules. The proposed ensemble approach is evaluated using
multiple metrics, among them a newly developed evaluation index for causal
ensemble approaches. We perform experiments using three synthetic datasets with
different volumes and complexity, which are specifically designed to test
causality detection methods under different circumstances while knowing the
ground truth causal relationships. In these experiments, our causality ensemble
outperforms each of its base algorithms. In practical applications, the use of
the proposed method could hence lead to more robust and reliable causality
results.
Related papers
- Causal discovery for time series with constraint-based model and PMIME
measure [0.0]
We present a novel approach for discovering causality in time series data that combines a causal discovery algorithm with an information theoretic-based measure.
We evaluate the performance of our approach on several simulated data sets, showing promising results.
arXiv Detail & Related papers (2023-05-31T09:38:50Z) - Robust Detection of Lead-Lag Relationships in Lagged Multi-Factor Models [61.10851158749843]
Key insights can be obtained by discovering lead-lag relationships inherent in the data.
We develop a clustering-driven methodology for robust detection of lead-lag relationships in lagged multi-factor models.
arXiv Detail & Related papers (2023-05-11T10:30:35Z) - A Survey on Causal Discovery Methods for I.I.D. and Time Series Data [4.57769506869942]
Causal Discovery (CD) algorithms can identify the cause-effect relationships among the variables of a system from related observational data.
We present an extensive discussion on the methods designed to perform causal discovery from both independent and identically distributed (I.I.D.) data and time series data.
arXiv Detail & Related papers (2023-03-27T09:21:41Z) - Learning to Bound Counterfactual Inference in Structural Causal Models
from Observational and Randomised Data [64.96984404868411]
We derive a likelihood characterisation for the overall data that leads us to extend a previous EM-based algorithm.
The new algorithm learns to approximate the (unidentifiability) region of model parameters from such mixed data sources.
It delivers interval approximations to counterfactual results, which collapse to points in the identifiable case.
arXiv Detail & Related papers (2022-12-06T12:42:11Z) - Valid Inference After Causal Discovery [73.87055989355737]
We develop tools for valid post-causal-discovery inference.
We show that a naive combination of causal discovery and subsequent inference algorithms leads to highly inflated miscoverage rates.
arXiv Detail & Related papers (2022-08-11T17:40:45Z) - Estimation of Bivariate Structural Causal Models by Variational Gaussian
Process Regression Under Likelihoods Parametrised by Normalising Flows [74.85071867225533]
Causal mechanisms can be described by structural causal models.
One major drawback of state-of-the-art artificial intelligence is its lack of explainability.
arXiv Detail & Related papers (2021-09-06T14:52:58Z) - Learning Neural Causal Models with Active Interventions [83.44636110899742]
We introduce an active intervention-targeting mechanism which enables a quick identification of the underlying causal structure of the data-generating process.
Our method significantly reduces the required number of interactions compared with random intervention targeting.
We demonstrate superior performance on multiple benchmarks from simulated to real-world data.
arXiv Detail & Related papers (2021-09-06T13:10:37Z) - Algorithms for Solving Nonlinear Binary Optimization Problems in Robust
Causal Inference [2.169755083801688]
We propose greedy algorithms to solve the robust causal inference test instances from observational data with continuous outcomes.
By leveraging the structure of the feasibility formulation, we develop greedy schemes that are efficient in solving robust test problems.
arXiv Detail & Related papers (2020-12-22T16:12:11Z) - ParKCa: Causal Inference with Partially Known Causes [7.1894784995284144]
Our proposed method ParKCA combines the results of several causal inference methods to learn new causes in applications with some known causes and many potential causes.
Our results show that ParKCA can infer more causes than existing methods.
arXiv Detail & Related papers (2020-03-17T21:36:56Z) - A Critical View of the Structural Causal Model [89.43277111586258]
We show that one can identify the cause and the effect without considering their interaction at all.
We propose a new adversarial training method that mimics the disentangled structure of the causal model.
Our multidimensional method outperforms the literature methods on both synthetic and real world datasets.
arXiv Detail & Related papers (2020-02-23T22:52:28Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.