Robustness Against Weak or Invalid Instruments: Exploring Nonlinear
Treatment Models with Machine Learning
- URL: http://arxiv.org/abs/2203.12808v4
- Date: Fri, 5 Jan 2024 04:21:55 GMT
- Title: Robustness Against Weak or Invalid Instruments: Exploring Nonlinear
Treatment Models with Machine Learning
- Authors: Zijian Guo and Mengchu Zheng and Peter B\"uhlmann
- Abstract summary: We discuss causal inference for observational studies with possibly invalid instrumental variables.
We propose a novel methodology called two-stage curvature identification (TSCI) by exploring the nonlinear treatment model with machine learning.
- Score: 1.3022753212679383
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We discuss causal inference for observational studies with possibly invalid
instrumental variables. We propose a novel methodology called two-stage
curvature identification (TSCI) by exploring the nonlinear treatment model with
machine learning. {The first-stage machine learning enables improving the
instrumental variable's strength and adjusting for different forms of violating
the instrumental variable assumptions.} The success of TSCI requires the
instrumental variable's effect on treatment to differ from its violation form.
A novel bias correction step is implemented to remove bias resulting from the
potentially high complexity of machine learning. Our proposed \texttt{TSCI}
estimator is shown to be asymptotically unbiased and Gaussian even if the
machine learning algorithm does not consistently estimate the treatment model.
Furthermore, we design a data-dependent method to choose the best among several
candidate violation forms. We apply TSCI to study the effect of education on
earnings.
Related papers
- Unlearning Information Bottleneck: Machine Unlearning of Systematic Patterns and Biases [6.936871609178494]
We present Unlearning Information Bottleneck (UIB), a novel information-theoretic framework designed to enhance the process of machine unlearning.
By proposing a variational upper bound, we recalibrate the model parameters through a dynamic prior that integrates changes in data distribution with an affordable computational cost.
Our experiments across various datasets, models, and unlearning methods demonstrate that our approach effectively removes systematic patterns and biases while maintaining the performance of models post-unlearning.
arXiv Detail & Related papers (2024-05-22T21:54:05Z) - Selective Nonparametric Regression via Testing [54.20569354303575]
We develop an abstention procedure via testing the hypothesis on the value of the conditional variance at a given point.
Unlike existing methods, the proposed one allows to account not only for the value of the variance itself but also for the uncertainty of the corresponding variance predictor.
arXiv Detail & Related papers (2023-09-28T13:04:11Z) - TSCI: two stage curvature identification for causal inference with
invalid instruments [0.0]
TSCI implements treatment effect estimation from observational data under invalid instruments in the R statistical computing environment.
It does not require the classical instrumental variable identification conditions and is effective even if all instruments are invalid.
arXiv Detail & Related papers (2023-04-02T11:12:53Z) - An Experimental Study of Dimension Reduction Methods on Machine Learning
Algorithms with Applications to Psychometrics [77.34726150561087]
We show that dimension reduction can decrease, increase, or provide the same accuracy as no reduction of variables.
Our tentative results find that dimension reduction tends to lead to better performance when used for classification tasks.
arXiv Detail & Related papers (2022-10-19T22:07:13Z) - MIRACLE: Causally-Aware Imputation via Learning Missing Data Mechanisms [82.90843777097606]
We propose a causally-aware imputation algorithm (MIRACLE) for missing data.
MIRACLE iteratively refines the imputation of a baseline by simultaneously modeling the missingness generating mechanism.
We conduct extensive experiments on synthetic and a variety of publicly available datasets to show that MIRACLE is able to consistently improve imputation.
arXiv Detail & Related papers (2021-11-04T22:38:18Z) - Using Data Assimilation to Train a Hybrid Forecast System that Combines
Machine-Learning and Knowledge-Based Components [52.77024349608834]
We consider the problem of data-assisted forecasting of chaotic dynamical systems when the available data is noisy partial measurements.
We show that by using partial measurements of the state of the dynamical system, we can train a machine learning model to improve predictions made by an imperfect knowledge-based model.
arXiv Detail & Related papers (2021-02-15T19:56:48Z) - Mostly Harmless Machine Learning: Learning Optimal Instruments in Linear
IV Models [3.7599363231894176]
We offer theoretical results that justify incorporating machine learning in the standard linear instrumental variable setting.
We use machine learning, combined with sample-splitting, to predict the treatment variable from the instrument.
This allows the researcher to extract non-linear co-variation between the treatment and instrument.
arXiv Detail & Related papers (2020-11-12T01:55:11Z) - Double Robust Representation Learning for Counterfactual Prediction [68.78210173955001]
We propose a novel scalable method to learn double-robust representations for counterfactual predictions.
We make robust and efficient counterfactual predictions for both individual and average treatment effects.
The algorithm shows competitive performance with the state-of-the-art on real world and synthetic data.
arXiv Detail & Related papers (2020-10-15T16:39:26Z) - Learning Stable Nonparametric Dynamical Systems with Gaussian Process
Regression [9.126353101382607]
We learn a nonparametric Lyapunov function based on Gaussian process regression from data.
We prove that stabilization of the nominal model based on the nonparametric control Lyapunov function does not modify the behavior of the nominal model at training samples.
arXiv Detail & Related papers (2020-06-14T11:17:17Z) - Active Learning for Gaussian Process Considering Uncertainties with
Application to Shape Control of Composite Fuselage [7.358477502214471]
We propose two new active learning algorithms for the Gaussian process with uncertainties.
We show that the proposed approach can incorporate the impact from uncertainties, and realize better prediction performance.
This approach has been applied to improving the predictive modeling for automatic shape control of composite fuselage.
arXiv Detail & Related papers (2020-04-23T02:04:53Z) - Machine learning for causal inference: on the use of cross-fit
estimators [77.34726150561087]
Doubly-robust cross-fit estimators have been proposed to yield better statistical properties.
We conducted a simulation study to assess the performance of several estimators for the average causal effect (ACE)
When used with machine learning, the doubly-robust cross-fit estimators substantially outperformed all of the other estimators in terms of bias, variance, and confidence interval coverage.
arXiv Detail & Related papers (2020-04-21T23:09:55Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.