Matching for causal effects via multimarginal optimal transport
- URL: http://arxiv.org/abs/2112.04398v1
- Date: Wed, 8 Dec 2021 16:45:31 GMT
- Title: Matching for causal effects via multimarginal optimal transport
- Authors: Florian Gunsilius and Yuliang Xu
- Abstract summary: This article introduces a natural optimal matching method based on entropy-regularized multimarginal optimal transport.
It provides interpretable weights of matched individuals that converge at the parametric rate to the optimal weights in the population, can be efficiently implemented via the classical iterative proportional fitting procedure, and can even match several treatment arms simultaneously.
- Score: 0.0
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Matching on covariates is a well-established framework for estimating causal
effects in observational studies. The principal challenge in these settings
stems from the often high-dimensional structure of the problem. Many methods
have been introduced to deal with this challenge, with different advantages and
drawbacks in computational and statistical performance and interpretability.
Moreover, the methodological focus has been on matching two samples in binary
treatment scenarios, but a dedicated method that can optimally balance samples
across multiple treatments has so far been unavailable. This article introduces
a natural optimal matching method based on entropy-regularized multimarginal
optimal transport that possesses many useful properties to address these
challenges. It provides interpretable weights of matched individuals that
converge at the parametric rate to the optimal weights in the population, can
be efficiently implemented via the classical iterative proportional fitting
procedure, and can even match several treatment arms simultaneously. It also
possesses demonstrably excellent finite sample properties.
Related papers
- On Sampling Strategies for Spectral Model Sharding [7.185534285278903]
In this work, we present two sampling strategies for such sharding.
The first produces unbiased estimators of the original weights, while the second aims to minimize the squared approximation error.
We demonstrate that both of these methods can lead to improved performance on various commonly used datasets.
arXiv Detail & Related papers (2024-10-31T16:37:25Z) - Differentiable Pareto-Smoothed Weighting for High-Dimensional Heterogeneous Treatment Effect Estimation [0.6906005491572401]
We develop a numerically robust estimator by weighted representation learning.
Our experimental results show that by effectively correcting the weight values, our proposed method outperforms the existing ones.
arXiv Detail & Related papers (2024-04-26T15:34:04Z) - Optimal Transport for Treatment Effect Estimation [42.50410909962438]
Estimating conditional average treatment effect from observational data is highly challenging due to the existence of treatment selection bias.
Prevalent methods mitigate this issue by aligning distributions of different treatment groups in the latent space.
We propose a principled approach named Entire Space CounterFactual Regression (ESCFR), which is a new take on optimal transport in the context of causality.
arXiv Detail & Related papers (2023-10-27T17:22:45Z) - High-Probability Convergence for Composite and Distributed Stochastic Minimization and Variational Inequalities with Heavy-Tailed Noise [96.80184504268593]
gradient, clipping is one of the key algorithmic ingredients to derive good high-probability guarantees.
Clipping can spoil the convergence of the popular methods for composite and distributed optimization.
arXiv Detail & Related papers (2023-10-03T07:49:17Z) - Comparison of Methods that Combine Multiple Randomized Trials to
Estimate Heterogeneous Treatment Effects [0.1398098625978622]
Leveraging multiple randomized controlled trials allows for the combination of datasets with unconfounded treatment assignment.
This paper discusses several non-parametric approaches for estimating heterogeneous treatment effects using data from multiple trials.
arXiv Detail & Related papers (2023-03-28T20:43:00Z) - TCFimt: Temporal Counterfactual Forecasting from Individual Multiple
Treatment Perspective [50.675845725806724]
We propose a comprehensive framework of temporal counterfactual forecasting from an individual multiple treatment perspective (TCFimt)
TCFimt constructs adversarial tasks in a seq2seq framework to alleviate selection and time-varying bias and designs a contrastive learning-based block to decouple a mixed treatment effect into separated main treatment effects and causal interactions.
The proposed method shows satisfactory performance in predicting future outcomes with specific treatments and in choosing optimal treatment type and timing than state-of-the-art methods.
arXiv Detail & Related papers (2022-12-17T15:01:05Z) - On Inductive Biases for Heterogeneous Treatment Effect Estimation [91.3755431537592]
We investigate how to exploit structural similarities of an individual's potential outcomes (POs) under different treatments.
We compare three end-to-end learning strategies to overcome this problem.
arXiv Detail & Related papers (2021-06-07T16:30:46Z) - Scalable Personalised Item Ranking through Parametric Density Estimation [53.44830012414444]
Learning from implicit feedback is challenging because of the difficult nature of the one-class problem.
Most conventional methods use a pairwise ranking approach and negative samplers to cope with the one-class problem.
We propose a learning-to-rank approach, which achieves convergence speed comparable to the pointwise counterpart.
arXiv Detail & Related papers (2021-05-11T03:38:16Z) - Deconfounding Scores: Feature Representations for Causal Effect
Estimation with Weak Overlap [140.98628848491146]
We introduce deconfounding scores, which induce better overlap without biasing the target of estimation.
We show that deconfounding scores satisfy a zero-covariance condition that is identifiable in observed data.
In particular, we show that this technique could be an attractive alternative to standard regularizations.
arXiv Detail & Related papers (2021-04-12T18:50:11Z) - Robust Optimal Transport with Applications in Generative Modeling and
Domain Adaptation [120.69747175899421]
Optimal Transport (OT) distances such as Wasserstein have been used in several areas such as GANs and domain adaptation.
We propose a computationally-efficient dual form of the robust OT optimization that is amenable to modern deep learning applications.
Our approach can train state-of-the-art GAN models on noisy datasets corrupted with outlier distributions.
arXiv Detail & Related papers (2020-10-12T17:13:40Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.