Synthetic Principal Component Design: Fast Covariate Balancing with
Synthetic Controls
- URL: http://arxiv.org/abs/2211.15241v1
- Date: Mon, 28 Nov 2022 11:45:54 GMT
- Title: Synthetic Principal Component Design: Fast Covariate Balancing with
Synthetic Controls
- Authors: Yiping Lu, Jiajin Li, Lexing Ying, Jose Blanchet
- Abstract summary: We develop a globally convergent and practically efficient optimization algorithm.
We establish the first global optimality guarantee for experiment design when pre-treatment data is sampled from certain data-generating processes.
- Score: 16.449993388646277
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: The optimal design of experiments typically involves solving an NP-hard
combinatorial optimization problem. In this paper, we aim to develop a globally
convergent and practically efficient optimization algorithm. Specifically, we
consider a setting where the pre-treatment outcome data is available and the
synthetic control estimator is invoked. The average treatment effect is
estimated via the difference between the weighted average outcomes of the
treated and control units, where the weights are learned from the observed
data. {Under this setting, we surprisingly observed that the optimal
experimental design problem could be reduced to a so-called \textit{phase
synchronization} problem.} We solve this problem via a normalized variant of
the generalized power method with spectral initialization. On the theoretical
side, we establish the first global optimality guarantee for experiment design
when pre-treatment data is sampled from certain data-generating processes.
Empirically, we conduct extensive experiments to demonstrate the effectiveness
of our method on both the US Bureau of Labor Statistics and the
Abadie-Diemond-Hainmueller California Smoking Data. In terms of the root mean
square error, our algorithm surpasses the random design by a large margin.
Related papers
- Probabilistic Bayesian optimal experimental design using conditional
normalizing flows [2.7689411149700685]
Bayesian optimal experimental design (OED) seeks to conduct the most informative experiment under budget constraints.
We propose a novel joint optimization approach to make the solution of the OED problem efficient, scalable, and robust for practical applications.
We demonstrate the performance proposed method for a practical MRI OED problem that has high-dimensional (320 $times 320) parameters at high image resolution, high-dimensional (640 $times 386) observations, and binary designs to select the most informative observations.
arXiv Detail & Related papers (2024-02-28T13:59:20Z) - Efficient adjustment for complex covariates: Gaining efficiency with
DOPE [56.537164957672715]
We propose a framework that accommodates adjustment for any subset of information expressed by the covariates.
Based on our theoretical results, we propose the Debiased Outcome-adapted Propensity Estorimator (DOPE) for efficient estimation of the average treatment effect (ATE)
Our results show that the DOPE provides an efficient and robust methodology for ATE estimation in various observational settings.
arXiv Detail & Related papers (2024-02-20T13:02:51Z) - Synthetic Design: An Optimization Approach to Experimental Design with
Synthetic Controls [5.3063411515511065]
We investigate the optimal design of experimental studies that have pre-treatment outcome data available.
The average treatment effect is estimated as the difference between the weighted average outcomes of the treated and control units.
We propose several methods for choosing the set of treated units in conjunction with the weights.
arXiv Detail & Related papers (2021-12-01T05:05:26Z) - Outlier-Robust Sparse Estimation via Non-Convex Optimization [73.18654719887205]
We explore the connection between high-dimensional statistics and non-robust optimization in the presence of sparsity constraints.
We develop novel and simple optimization formulations for these problems.
As a corollary, we obtain that any first-order method that efficiently converges to station yields an efficient algorithm for these tasks.
arXiv Detail & Related papers (2021-09-23T17:38:24Z) - Zeroth-Order Hybrid Gradient Descent: Towards A Principled Black-Box
Optimization Framework [100.36569795440889]
This work is on the iteration of zero-th-order (ZO) optimization which does not require first-order information.
We show that with a graceful design in coordinate importance sampling, the proposed ZO optimization method is efficient both in terms of complexity as well as as function query cost.
arXiv Detail & Related papers (2020-12-21T17:29:58Z) - Optimal Bayesian experimental design for subsurface flow problems [77.34726150561087]
We propose a novel approach for development of chaos expansion (PCE) surrogate model for the design utility function.
This novel technique enables the derivation of a reasonable quality response surface for the targeted objective function with a computational budget comparable to several single-point evaluations.
arXiv Detail & Related papers (2020-08-10T09:42:59Z) - Incorporating Expert Prior Knowledge into Experimental Design via
Posterior Sampling [58.56638141701966]
Experimenters can often acquire the knowledge about the location of the global optimum.
It is unknown how to incorporate the expert prior knowledge about the global optimum into Bayesian optimization.
An efficient Bayesian optimization approach has been proposed via posterior sampling on the posterior distribution of the global optimum.
arXiv Detail & Related papers (2020-02-26T01:57:36Z) - Distributed Averaging Methods for Randomized Second Order Optimization [54.51566432934556]
We consider distributed optimization problems where forming the Hessian is computationally challenging and communication is a bottleneck.
We develop unbiased parameter averaging methods for randomized second order optimization that employ sampling and sketching of the Hessian.
We also extend the framework of second order averaging methods to introduce an unbiased distributed optimization framework for heterogeneous computing systems.
arXiv Detail & Related papers (2020-02-16T09:01:18Z) - Optimal Experimental Design for Staggered Rollouts [11.187415608299075]
We study the design and analysis of experiments conducted on a set of units over multiple time periods where the starting time of the treatment may vary by unit.
We propose a new algorithm, the Precision-Guided Adaptive Experiment (PGAE) algorithm, that addresses the challenges at both the design stage and at the stage of estimating treatment effects.
arXiv Detail & Related papers (2019-11-09T19:46:29Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.