Continual Causal Inference with Incremental Observational Data
- URL: http://arxiv.org/abs/2303.01775v1
- Date: Fri, 3 Mar 2023 08:33:15 GMT
- Title: Continual Causal Inference with Incremental Observational Data
- Authors: Zhixuan Chu, Ruopeng Li, Stephen Rathbun, Sheng Li
- Abstract summary: We propose a Continual Causal Effect Representation Learning method for estimating causal effects with observational data.
Our method achieves the continual causal effect estimation for new data without compromising the estimation capability for original data.
- Score: 8.543321506666636
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The era of big data has witnessed an increasing availability of observational
data from mobile and social networking, online advertising, web mining,
healthcare, education, public policy, marketing campaigns, and so on, which
facilitates the development of causal effect estimation. Although significant
advances have been made to overcome the challenges in the academic area, such
as missing counterfactual outcomes and selection bias, they only focus on
source-specific and stationary observational data, which is unrealistic in most
industrial applications. In this paper, we investigate a new industrial problem
of causal effect estimation from incrementally available observational data and
present three new evaluation criteria accordingly, including extensibility,
adaptability, and accessibility. We propose a Continual Causal Effect
Representation Learning method for estimating causal effects with observational
data, which are incrementally available from non-stationary data distributions.
Instead of having access to all seen observational data, our method only stores
a limited subset of feature representations learned from previous data.
Combining selective and balanced representation learning, feature
representation distillation, and feature transformation, our method achieves
the continual causal effect estimation for new data without compromising the
estimation capability for original data. Extensive experiments demonstrate the
significance of continual causal effect estimation and the effectiveness of our
method.
Related papers
- Explanatory Model Monitoring to Understand the Effects of Feature Shifts on Performance [61.06245197347139]
We propose a novel approach to explain the behavior of a black-box model under feature shifts.
We refer to our method that combines concepts from Optimal Transport and Shapley Values as Explanatory Performance Estimation.
arXiv Detail & Related papers (2024-08-24T18:28:19Z) - On Counterfactual Data Augmentation Under Confounding [30.76982059341284]
Counterfactual data augmentation has emerged as a method to mitigate confounding biases in the training data.
These biases arise due to various observed and unobserved confounding variables in the data generation process.
We show how our simple augmentation method helps existing state-of-the-art methods achieve good results.
arXiv Detail & Related papers (2023-05-29T16:20:23Z) - Continual Causal Effect Estimation: Challenges and Opportunities [11.343298687766579]
A further understanding of cause and effect within observational data is critical across many domains.
The existing methods mainly focus on source-specific and stationary observational data.
In the era of big data, we face new challenges in causal inference with observational data.
arXiv Detail & Related papers (2023-01-03T09:57:50Z) - Striving for data-model efficiency: Identifying data externalities on
group performance [75.17591306911015]
Building trustworthy, effective, and responsible machine learning systems hinges on understanding how differences in training data and modeling decisions interact to impact predictive performance.
We focus on a particular type of data-model inefficiency, in which adding training data from some sources can actually lower performance evaluated on key sub-groups of the population.
Our results indicate that data-efficiency is a key component of both accurate and trustworthy machine learning.
arXiv Detail & Related papers (2022-11-11T16:48:27Z) - Systematic Evaluation of Predictive Fairness [60.0947291284978]
Mitigating bias in training on biased datasets is an important open problem.
We examine the performance of various debiasing methods across multiple tasks.
We find that data conditions have a strong influence on relative model performance.
arXiv Detail & Related papers (2022-10-17T05:40:13Z) - Data-Driven Causal Effect Estimation Based on Graphical Causal
Modelling: A Survey [30.115088044583953]
We review data-driven methods on causal effect estimation using graphical causal modelling.
We identify and discuss the challenges faced by data-driven causal effect estimation.
We hope this review will motivate more researchers to design better data-driven methods.
arXiv Detail & Related papers (2022-08-20T03:25:58Z) - Cycle-Balanced Representation Learning For Counterfactual Inference [42.229586802733806]
We propose a novel framework based on Cycle-Balanced REpresentation learning for counterfactual inference (CBRE)
Specifically, we realize a robust balanced representation for different groups using adversarial training, and meanwhile construct an information loop, such that preserve original data properties cyclically.
Results on three real-world datasets demonstrate that CBRE matches/outperforms the state-of-the-art methods, and it has a great potential to be applied to counterfactual inference.
arXiv Detail & Related papers (2021-10-29T01:15:16Z) - Federated Estimation of Causal Effects from Observational Data [19.657789891394504]
We present a novel framework for causal inference with federated data sources.
We assess and integrate local causal effects from different private data sources without centralizing them.
arXiv Detail & Related papers (2021-05-31T08:06:00Z) - Accurate and Robust Feature Importance Estimation under Distribution
Shifts [49.58991359544005]
PRoFILE is a novel feature importance estimation method.
We show significant improvements over state-of-the-art approaches, both in terms of fidelity and robustness.
arXiv Detail & Related papers (2020-09-30T05:29:01Z) - Provably Efficient Causal Reinforcement Learning with Confounded
Observational Data [135.64775986546505]
We study how to incorporate the dataset (observational data) collected offline, which is often abundantly available in practice, to improve the sample efficiency in the online setting.
We propose the deconfounded optimistic value iteration (DOVI) algorithm, which incorporates the confounded observational data in a provably efficient manner.
arXiv Detail & Related papers (2020-06-22T14:49:33Z) - Enabling Counterfactual Survival Analysis with Balanced Representations [64.17342727357618]
Survival data are frequently encountered across diverse medical applications, i.e., drug development, risk profiling, and clinical trials.
We propose a theoretically grounded unified framework for counterfactual inference applicable to survival outcomes.
arXiv Detail & Related papers (2020-06-14T01:15:00Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.