DESCN: Deep Entire Space Cross Networks for Individual Treatment Effect
Estimation
- URL: http://arxiv.org/abs/2207.09920v3
- Date: Fri, 20 Oct 2023 03:27:23 GMT
- Title: DESCN: Deep Entire Space Cross Networks for Individual Treatment Effect
Estimation
- Authors: Kailiang Zhong, Fengtong Xiao, Yan Ren, Yaorong Liang, Wenqing Yao,
Xiaofeng Yang, and Ling Cen
- Abstract summary: Causal Inference has wide applications in various areas such as E-commerce and precision medicine.
This paper proposes Deep Entire Space Cross Networks (DESCN) to model treatment effects from an end-to-end perspective.
- Score: 7.060064266376701
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Causal Inference has wide applications in various areas such as E-commerce
and precision medicine, and its performance heavily relies on the accurate
estimation of the Individual Treatment Effect (ITE). Conventionally, ITE is
predicted by modeling the treated and control response functions separately in
their individual sample spaces. However, such an approach usually encounters
two issues in practice, i.e. divergent distribution between treated and control
groups due to treatment bias, and significant sample imbalance of their
population sizes. This paper proposes Deep Entire Space Cross Networks (DESCN)
to model treatment effects from an end-to-end perspective. DESCN captures the
integrated information of the treatment propensity, the response, and the
hidden treatment effect through a cross network in a multi-task learning
manner. Our method jointly learns the treatment and response functions in the
entire sample space to avoid treatment bias and employs an intermediate pseudo
treatment effect prediction network to relieve sample imbalance. Extensive
experiments are conducted on a synthetic dataset and a large-scaled production
dataset from the E-commerce voucher distribution business. The results indicate
that DESCN can successfully enhance the accuracy of ITE estimation and improve
the uplift ranking performance. A sample of the production dataset and the
source code are released to facilitate future research in the community, which
is, to the best of our knowledge, the first large-scale public biased treatment
dataset for causal inference.
Related papers
- Model-Based Inference and Experimental Design for Interference Using Partial Network Data [4.76518127830168]
We present a framework for the estimation and inference of treatment effect adjustments using partial network data.
We illustrate procedures to assign treatments using only partial network data.
We validate our approach using simulated experiments on observed graphs with applications to information diffusion in India and Malawi.
arXiv Detail & Related papers (2024-06-17T17:27:18Z) - Estimation of individual causal effects in network setup for multiple
treatments [4.53340898566495]
We study the problem of estimation of Individual Treatment Effects (ITE) in the context of multiple treatments and observational data.
We employ Graph Convolutional Networks (GCN) to learn a shared representation of the confounders.
Our approach utilizes separate neural networks to infer potential outcomes for each treatment.
arXiv Detail & Related papers (2023-12-18T06:07:45Z) - Assessment of Treatment Effect Estimators for Heavy-Tailed Data [70.72363097550483]
A central obstacle in the objective assessment of treatment effect (TE) estimators in randomized control trials (RCTs) is the lack of ground truth (or validation set) to test their performance.
We provide a novel cross-validation-like methodology to address this challenge.
We evaluate our methodology across 709 RCTs implemented in the Amazon supply chain.
arXiv Detail & Related papers (2021-12-14T17:53:01Z) - Disentangled Counterfactual Recurrent Networks for Treatment Effect
Inference over Time [71.30985926640659]
We introduce the Disentangled Counterfactual Recurrent Network (DCRN), a sequence-to-sequence architecture that estimates treatment outcomes over time.
With an architecture that is completely inspired by the causal structure of treatment influence over time, we advance forecast accuracy and disease understanding.
We demonstrate that DCRN outperforms current state-of-the-art methods in forecasting treatment responses, on both real and simulated data.
arXiv Detail & Related papers (2021-12-07T16:40:28Z) - Causal Effect Variational Autoencoder with Uniform Treatment [50.895390968371665]
Causal effect variational autoencoder (CEVAE) are trained to predict the outcome given observational treatment data.
Uniform treatment variational autoencoders (UTVAE) are trained with uniform treatment distribution using importance sampling.
arXiv Detail & Related papers (2021-11-16T17:40:57Z) - Causal-BALD: Deep Bayesian Active Learning of Outcomes to Infer
Treatment-Effects from Observational Data [37.15330590319357]
Existing approaches rely on fitting deep models on outcomes observed for treated and control populations.
Deep Bayesian active learning provides a framework for efficient data acquisition by selecting points with high uncertainty.
We introduce causal, Bayesian acquisition functions grounded in information theory that bias data acquisition towards regions with overlapping support.
arXiv Detail & Related papers (2021-11-03T15:11:39Z) - Provably Efficient Causal Reinforcement Learning with Confounded
Observational Data [135.64775986546505]
We study how to incorporate the dataset (observational data) collected offline, which is often abundantly available in practice, to improve the sample efficiency in the online setting.
We propose the deconfounded optimistic value iteration (DOVI) algorithm, which incorporates the confounded observational data in a provably efficient manner.
arXiv Detail & Related papers (2020-06-22T14:49:33Z) - Regret Minimization for Causal Inference on Large Treatment Space [21.957539112375496]
We propose a network architecture and a regularizer that extracts a debiased representation from biased observational data.
Our proposed loss minimizes a classification error of whether or not the action is relatively good for the individual target.
arXiv Detail & Related papers (2020-06-10T02:19:48Z) - DTR Bandit: Learning to Make Response-Adaptive Decisions With Low Regret [59.81290762273153]
Dynamic treatment regimes (DTRs) are personalized, adaptive, multi-stage treatment plans that adapt treatment decisions to an individual's initial features and to intermediate outcomes and features at each subsequent stage.
We propose a novel algorithm that, by carefully balancing exploration and exploitation, is guaranteed to achieve rate-optimal regret when the transition and reward models are linear.
arXiv Detail & Related papers (2020-05-06T13:03:42Z) - Generalization Bounds and Representation Learning for Estimation of
Potential Outcomes and Causal Effects [61.03579766573421]
We study estimation of individual-level causal effects, such as a single patient's response to alternative medication.
We devise representation learning algorithms that minimize our bound, by regularizing the representation's induced treatment group distance.
We extend these algorithms to simultaneously learn a weighted representation to further reduce treatment group distances.
arXiv Detail & Related papers (2020-01-21T10:16:33Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.