Game-Theoretic Algorithms for Conditional Moment Matching
- URL: http://arxiv.org/abs/2208.09551v1
- Date: Fri, 19 Aug 2022 21:24:55 GMT
- Title: Game-Theoretic Algorithms for Conditional Moment Matching
- Authors: Gokul Swamy and Sanjiban Choudhury and J. Andrew Bagnell and Zhiwei
Steven Wu
- Abstract summary: We derive a general, game-theoretic strategy for satisfying conditional moment restrictions (CMR)
We detail various extensions and how to efficiently solve the game defined by CMR.
- Score: 39.4969161422156
- License: http://creativecommons.org/licenses/by-sa/4.0/
- Abstract: A variety of problems in econometrics and machine learning, including
instrumental variable regression and Bellman residual minimization, can be
formulated as satisfying a set of conditional moment restrictions (CMR). We
derive a general, game-theoretic strategy for satisfying CMR that scales to
nonlinear problems, is amenable to gradient-based optimization, and is able to
account for finite sample uncertainty. We recover the approaches of Dikkala et
al. and Dai et al. as special cases of our general framework before detailing
various extensions and how to efficiently solve the game defined by CMR.
Related papers
- Alternating Minimization Schemes for Computing Rate-Distortion-Perception Functions with $f$-Divergence Perception Constraints [10.564071872770146]
We study the computation of the rate-distortion-perception function (RDPF) for discrete memoryless sources.
We characterize the optimal parametric solutions.
We provide sufficient conditions on the distortion and the perception constraints.
arXiv Detail & Related papers (2024-08-27T12:50:12Z) - Functional Generalized Empirical Likelihood Estimation for Conditional
Moment Restrictions [19.39005034948997]
We propose a new estimation method based on generalized empirical likelihood (GEL)
GEL provides a more general framework and has been shown to enjoy favorable small-sample properties compared to GMM-based estimators.
We provide kernel- and neural network-based implementations of the estimator, which achieve state-of-the-art empirical performance on two conditional moment restriction problems.
arXiv Detail & Related papers (2022-07-11T11:02:52Z) - A unified stochastic approximation framework for learning in games [82.74514886461257]
We develop a flexible approximation framework for analyzing the long-run behavior of learning in games (both continuous and finite)
The proposed analysis template incorporates a wide array of popular learning algorithms, including gradient-based methods, exponential/multiplicative weights for learning in finite games, optimistic and bandit variants of the above, etc.
arXiv Detail & Related papers (2022-06-08T14:30:38Z) - Faster One-Sample Stochastic Conditional Gradient Method for Composite
Convex Minimization [61.26619639722804]
We propose a conditional gradient method (CGM) for minimizing convex finite-sum objectives formed as a sum of smooth and non-smooth terms.
The proposed method, equipped with an average gradient (SAG) estimator, requires only one sample per iteration. Nevertheless, it guarantees fast convergence rates on par with more sophisticated variance reduction techniques.
arXiv Detail & Related papers (2022-02-26T19:10:48Z) - The Geometry of Memoryless Stochastic Policy Optimization in
Infinite-Horizon POMDPs [0.0]
We consider the problem of finding the best memoryless policy for an infinite-horizon partially observable decision process.
We show that the discounted state-action frequencies and the expected cumulative reward are the functions of the policy, whereby the degree is determined by the degree of partial observability.
arXiv Detail & Related papers (2021-10-14T14:42:09Z) - Reinforcement Learning for Adaptive Mesh Refinement [63.7867809197671]
We propose a novel formulation of AMR as a Markov decision process and apply deep reinforcement learning to train refinement policies directly from simulation.
The model sizes of these policy architectures are independent of the mesh size and hence scale to arbitrarily large and complex simulations.
arXiv Detail & Related papers (2021-03-01T22:55:48Z) - The Variational Method of Moments [65.91730154730905]
conditional moment problem is a powerful formulation for describing structural causal parameters in terms of observables.
Motivated by a variational minimax reformulation of OWGMM, we define a very general class of estimators for the conditional moment problem.
We provide algorithms for valid statistical inference based on the same kind of variational reformulations.
arXiv Detail & Related papers (2020-12-17T07:21:06Z) - Conditional gradient methods for stochastically constrained convex
minimization [54.53786593679331]
We propose two novel conditional gradient-based methods for solving structured convex optimization problems.
The most important feature of our framework is that only a subset of the constraints is processed at each iteration.
Our algorithms rely on variance reduction and smoothing used in conjunction with conditional gradient steps, and are accompanied by rigorous convergence guarantees.
arXiv Detail & Related papers (2020-07-07T21:26:35Z) - Sparse recovery by reduced variance stochastic approximation [5.672132510411465]
We discuss application of iterative quadratic optimization routines to the problem of sparse signal recovery from noisy observation.
We show how one can straightforwardly enhance reliability of the corresponding solution by using Median-of-Means like techniques.
arXiv Detail & Related papers (2020-06-11T12:31:20Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.