Generalized Kernel Ridge Regression for Long Term Causal Inference:
Treatment Effects, Dose Responses, and Counterfactual Distributions
- URL: http://arxiv.org/abs/2201.05139v1
- Date: Thu, 13 Jan 2022 18:51:56 GMT
- Title: Generalized Kernel Ridge Regression for Long Term Causal Inference:
Treatment Effects, Dose Responses, and Counterfactual Distributions
- Authors: Rahul Singh
- Abstract summary: I propose estimators of treatment effects, dose responses, and counterfactual distributions.
For long term treatment effects, I prove $sqrtn$ consistency, Gaussian approximation, and semiparametric efficiency.
For long term dose responses, I prove uniform consistency with finite sample rates.
- Score: 6.441975792340023
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: I propose kernel ridge regression estimators for long term causal inference,
where a short term experimental data set containing randomized treatment and
short term surrogates is fused with a long term observational data set
containing short term surrogates and long term outcomes. I propose estimators
of treatment effects, dose responses, and counterfactual distributions with
closed form solutions in terms of kernel matrix operations. I allow covariates,
treatment, and surrogates to be discrete or continuous, and low, high, or
infinite dimensional. For long term treatment effects, I prove $\sqrt{n}$
consistency, Gaussian approximation, and semiparametric efficiency. For long
term dose responses, I prove uniform consistency with finite sample rates. For
long term counterfactual distributions, I prove convergence in distribution.
Related papers
- Long-term Causal Inference via Modeling Sequential Latent Confounding [49.64731441006396]
Ghassami et al. propose an approach based on the Conditional Additive Equi-Confounding Bias (CAECB) assumption.
We introduce a novel assumption that extends the CAECB assumption to accommodate temporal short-term outcomes.
Our proposed assumption states a functional relationship between sequential confounding biases across temporal short-term outcomes.
arXiv Detail & Related papers (2025-02-26T09:56:56Z) - Theory on Score-Mismatched Diffusion Models and Zero-Shot Conditional Samplers [49.97755400231656]
We present the first performance guarantee with explicit dimensional general score-mismatched diffusion samplers.
We show that score mismatches result in an distributional bias between the target and sampling distributions, proportional to the accumulated mismatch between the target and training distributions.
This result can be directly applied to zero-shot conditional samplers for any conditional model, irrespective of measurement noise.
arXiv Detail & Related papers (2024-10-17T16:42:12Z) - Non-asymptotic bounds for forward processes in denoising diffusions: Ornstein-Uhlenbeck is hard to beat [49.1574468325115]
This paper presents explicit non-asymptotic bounds on the forward diffusion error in total variation (TV)
We parametrise multi-modal data distributions in terms of the distance $R$ to their furthest modes and consider forward diffusions with additive and multiplicative noise.
arXiv Detail & Related papers (2024-08-25T10:28:31Z) - Risk and cross validation in ridge regression with correlated samples [72.59731158970894]
We provide training examples for the in- and out-of-sample risks of ridge regression when the data points have arbitrary correlations.
We further extend our analysis to the case where the test point has non-trivial correlations with the training set, setting often encountered in time series forecasting.
We validate our theory across a variety of high dimensional data.
arXiv Detail & Related papers (2024-08-08T17:27:29Z) - Estimating Long-term Heterogeneous Dose-response Curve: Generalization Bound Leveraging Optimal Transport Weights [23.602196005738676]
Long-term causal effect estimation is a significant but challenging problem in many applications.
Existing methods rely on ideal assumptions to estimate long-term average effects.
arXiv Detail & Related papers (2024-06-27T14:13:46Z) - Relaxed Quantile Regression: Prediction Intervals for Asymmetric Noise [51.87307904567702]
Quantile regression is a leading approach for obtaining such intervals via the empirical estimation of quantiles in the distribution of outputs.
We propose Relaxed Quantile Regression (RQR), a direct alternative to quantile regression based interval construction that removes this arbitrary constraint.
We demonstrate that this added flexibility results in intervals with an improvement in desirable qualities.
arXiv Detail & Related papers (2024-06-05T13:36:38Z) - A Study of Posterior Stability for Time-Series Latent Diffusion [59.41969496514184]
We first show that posterior collapse will reduce latent diffusion to a variational autoencoder (VAE), making it less expressive.
We then introduce a principled method: dependency measure, that quantifies the sensitivity of a recurrent decoder to input variables.
Building on our theoretical and empirical studies, we introduce a new framework that extends latent diffusion and has a stable posterior.
arXiv Detail & Related papers (2024-05-22T21:54:12Z) - Simultaneous Inference for Local Structural Parameters with Random Forests [19.014535120129338]
We construct simultaneous confidence intervals for solutions to conditional moment equations.
We obtain several new order-explicit results on the concentration and normal approximation of high-dimensional U.S.
As a by-product, we obtain several new order-explicit results on the concentration and normal approximation of high-dimensional U.S.
arXiv Detail & Related papers (2024-05-13T15:46:11Z) - Long-term Off-Policy Evaluation and Learning [21.047613223586794]
Short- and long-term outcomes of an algorithm often differ, with damaging downstream effects.
It takes months or even longer to observe the long-term outcomes of interest, making the algorithm selection process unacceptably slow.
We propose a new framework called Long-term Off-Policy Evaluation (LOPE), which is based on reward function decomposition.
arXiv Detail & Related papers (2024-04-24T06:59:59Z) - Symmetric Mean-field Langevin Dynamics for Distributional Minimax
Problems [78.96969465641024]
We extend mean-field Langevin dynamics to minimax optimization over probability distributions for the first time with symmetric and provably convergent updates.
We also study time and particle discretization regimes and prove a new uniform-in-time propagation of chaos result.
arXiv Detail & Related papers (2023-12-02T13:01:29Z) - Choosing a Proxy Metric from Past Experiments [54.338884612982405]
In many randomized experiments, the treatment effect of the long-term metric is often difficult or infeasible to measure.
A common alternative is to measure several short-term proxy metrics in the hope they closely track the long-term metric.
We introduce a new statistical framework to both define and construct an optimal proxy metric for use in a homogeneous population of randomized experiments.
arXiv Detail & Related papers (2023-09-14T17:43:02Z) - Ensembled Prediction Intervals for Causal Outcomes Under Hidden
Confounding [49.1865229301561]
We present a simple approach to partial identification using existing causal sensitivity models and show empirically that Caus-Modens gives tighter outcome intervals.
The last of our three diverse benchmarks is a novel usage of GPT-4 for observational experiments with unknown but probeable ground truth.
arXiv Detail & Related papers (2023-06-15T21:42:40Z) - Reconstructing Graph Diffusion History from a Single Snapshot [87.20550495678907]
We propose a novel barycenter formulation for reconstructing Diffusion history from A single SnapsHot (DASH)
We prove that estimation error of diffusion parameters is unavoidable due to NP-hardness of diffusion parameter estimation.
We also develop an effective solver named DIffusion hiTting Times with Optimal proposal (DITTO)
arXiv Detail & Related papers (2023-06-01T09:39:32Z) - Estimating long-term causal effects from short-term experiments and
long-term observational data with unobserved confounding [5.854757988966379]
We study the identification and estimation of long-term treatment effects when both experimental and observational data are available.
Our long-term causal effect estimator is obtained by combining regression residuals with short-term experimental outcomes.
arXiv Detail & Related papers (2023-02-21T12:22:47Z) - Predicting conditional probability distributions of redshifts of Active
Galactic Nuclei using Hierarchical Correlation Reconstruction [0.8702432681310399]
This article applies Hierarchical Correlation Reconstruction (HCR) approach to inexpensively predict conditional probability distributions.
We get interpretable models: with coefficients describing contributions of features to conditional moments.
This article extends on the original approach especially by using Canonical Correlation Analysis (CCA) for feature optimization and l1 "lasso" regularization.
arXiv Detail & Related papers (2022-06-13T14:28:53Z) - On the Benefits of Large Learning Rates for Kernel Methods [110.03020563291788]
We show that a phenomenon can be precisely characterized in the context of kernel methods.
We consider the minimization of a quadratic objective in a separable Hilbert space, and show that with early stopping, the choice of learning rate influences the spectral decomposition of the obtained solution.
arXiv Detail & Related papers (2022-02-28T13:01:04Z) - Long-term Causal Inference Under Persistent Confounding via Data Combination [38.026740610259225]
We study the identification and estimation of long-term treatment effects when both experimental and observational data are available.
Since the long-term outcome is observed only after a long delay, it is not measured in the experimental data, but only recorded in the observational data.
arXiv Detail & Related papers (2022-02-15T07:44:20Z) - Generalized Kernel Ridge Regression for Causal Inference with
Missing-at-Random Sample Selection [3.398662563413433]
I propose kernel ridge regression estimators for nonparametric dose response curves and semiparametric treatment effects.
For the discrete treatment case, I prove root-n consistency, Gaussian approximation, and semiparametric efficiency.
arXiv Detail & Related papers (2021-11-09T17:10:49Z) - Sequential Kernel Embedding for Mediated and Time-Varying Dose Response
Curves [26.880628841819004]
We propose simple nonparametric estimators for mediated and time-varying dose response curves based on kernel ridge regression.
Our key innovation is a reproducing kernel Hilbert space technique called sequential kernel embedding.
arXiv Detail & Related papers (2021-11-06T19:51:39Z) - Long-Term Effect Estimation with Surrogate Representation [43.932546958874696]
This work studies the problem of long-term effect where the outcome of primary interest, or primary outcome, takes months or even years to accumulate.
We propose to build connections between long-term causal inference and sequential models in machine learning.
arXiv Detail & Related papers (2020-08-19T03:16:18Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.