Identification and Debiased Learning of Causal Effects with General Instrumental Variables
- URL: http://arxiv.org/abs/2510.20404v1
- Date: Thu, 23 Oct 2025 10:10:11 GMT
- Title: Identification and Debiased Learning of Causal Effects with General Instrumental Variables
- Authors: Shuyuan Chen, Peng Zhang, Yifan Cui,
- Abstract summary: We develop a general nonparametric framework for identification and learning with multi-categorical or continuous instrumental variables.<n>We derive consistent, efficientally normal estimators via debiased machine learning.<n>We demonstrate the proposed method by employing simulation studies and analyzing real data from the Job Training Partnership Act program.
- Score: 5.00731378650601
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Instrumental variable methods are fundamental to causal inference when treatment assignment is confounded by unobserved variables. In this article, we develop a general nonparametric framework for identification and learning with multi-categorical or continuous instrumental variables. Specifically, we propose an additive instrumental variable framework to identify mean potential outcomes and the average treatment effect with a weighting function. Leveraging semiparametric theory, we derive efficient influence functions and construct consistent, asymptotically normal estimators via debiased machine learning. Extensions to longitudinal data, dynamic treatment regimes, and multiplicative instrumental variables are further developed. We demonstrate the proposed method by employing simulation studies and analyzing real data from the Job Training Partnership Act program.
Related papers
- Causal Inference for Preprocessed Outcomes with an Application to Functional Connectivity [1.7078619393681336]
We propose a semiparametric framework for causal inference with derived outcomes obtained after intra-subject processing.<n>We specialize the framework to a mediation setting and focus on the natural direct effect.<n>We apply our method to estimate the impact of stimulant medication on brain connectivity in children with autism spectrum disorder.
arXiv Detail & Related papers (2026-02-02T15:48:34Z) - Double Machine Learning of Continuous Treatment Effects with General Instrumental Variables [5.00731378650601]
Estimating causal effects of continuous treatments is a common problem in practice.<n>We propose a novel framework for local identification of dose-response functions using instrumental variables.
arXiv Detail & Related papers (2026-01-04T10:29:53Z) - A Tutorial on Dimensionless Learning: Geometric Interpretation and the Effect of Noise [0.0]
Dimensionless learning is a data-driven framework for discovering dimensionless numbers and scaling laws from experimental measurements.<n>This tutorial introduces the method, explaining how it transforms experimental data into compact physical laws.
arXiv Detail & Related papers (2025-12-12T06:56:24Z) - Meta-Statistical Learning: Supervised Learning of Statistical Inference [59.463430294611626]
This work demonstrates that the tools and principles driving the success of large language models (LLMs) can be repurposed to tackle distribution-level tasks.<n>We propose meta-statistical learning, a framework inspired by multi-instance learning that reformulates statistical inference tasks as supervised learning problems.
arXiv Detail & Related papers (2025-02-17T18:04:39Z) - Model-free Methods for Event History Analysis and Efficient Adjustment (PhD Thesis) [55.2480439325792]
This thesis is a series of independent contributions to statistics unified by a model-free perspective.<n>The first chapter elaborates on how a model-free perspective can be used to formulate flexible methods that leverage prediction techniques from machine learning.<n>The second chapter studies the concept of local independence, which describes whether the evolution of one process is directly influenced by another.
arXiv Detail & Related papers (2025-02-11T19:24:09Z) - Information theory for data-driven model reduction in physics and biology [0.0]
We develop an approach to identify relevant variables, defined as those most predictive of the future.<n>In the limit of high compression, the relevant variables are directly determined by the slowest-decaying eigenfunctions.<n>Our results provide a firm foundation to interpret deep learning tools that automatically identify reduced variables.
arXiv Detail & Related papers (2023-12-11T18:39:05Z) - A Semiparametric Instrumented Difference-in-Differences Approach to
Policy Learning [2.1989182578668243]
We propose a general instrumented difference-in-differences (DiD) approach for learning the optimal treatment policy.
Specifically, we establish identification results using a binary instrumental variable (IV) when the parallel trends assumption fails to hold.
We also construct a Wald estimator, novel inverse probability estimators, and a class of semi efficient and multiply robust estimators.
arXiv Detail & Related papers (2023-10-14T09:38:32Z) - A Mechanistic Interpretation of Arithmetic Reasoning in Language Models
using Causal Mediation Analysis [128.0532113800092]
We present a mechanistic interpretation of Transformer-based LMs on arithmetic questions.
This provides insights into how information related to arithmetic is processed by LMs.
arXiv Detail & Related papers (2023-05-24T11:43:47Z) - Identification and multiply robust estimation in causal mediation analysis across principal strata [7.801213477601286]
We consider assessing causal mediation in the presence of a post-treatment event.
We derive the efficient influence function for each mediation estimand, which motivates a set of multiply robust estimators for inference.
arXiv Detail & Related papers (2023-04-20T00:39:20Z) - Variable Importance Matching for Causal Inference [73.25504313552516]
We describe a general framework called Model-to-Match that achieves these goals.
Model-to-Match uses variable importance measurements to construct a distance metric.
We operationalize the Model-to-Match framework with LASSO.
arXiv Detail & Related papers (2023-02-23T00:43:03Z) - Post Reinforcement Learning Inference [20.521169740409263]
We study estimation and inference using data collected by reinforcement learning algorithms.<n>We propose a weighted generalized method of moments (GMM) that uses adaptive weights to stabilize this variance.<n>Key applications include dynamic treatment effect estimation and dynamic off-policy evaluation.
arXiv Detail & Related papers (2023-02-17T12:53:15Z) - Robustness Against Weak or Invalid Instruments: Exploring Nonlinear
Treatment Models with Machine Learning [1.3022753212679383]
We discuss causal inference for observational studies with possibly invalid instrumental variables.
We propose a novel methodology called two-stage curvature identification (TSCI) by exploring the nonlinear treatment model with machine learning.
arXiv Detail & Related papers (2022-03-24T02:19:24Z) - Counterfactual Representation Learning with Balancing Weights [74.67296491574318]
Key to causal inference with observational data is achieving balance in predictive features associated with each treatment type.
Recent literature has explored representation learning to achieve this goal.
We develop an algorithm for flexible, scalable and accurate estimation of causal effects.
arXiv Detail & Related papers (2020-10-23T19:06:03Z) - Estimating Structural Target Functions using Machine Learning and
Influence Functions [103.47897241856603]
We propose a new framework for statistical machine learning of target functions arising as identifiable functionals from statistical models.
This framework is problem- and model-agnostic and can be used to estimate a broad variety of target parameters of interest in applied statistics.
We put particular focus on so-called coarsening at random/doubly robust problems with partially unobserved information.
arXiv Detail & Related papers (2020-08-14T16:48:29Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.