Optimization-Based MCMC Methods for Nonlinear Hierarchical Statistical
Inverse Problems
- URL: http://arxiv.org/abs/2002.06358v1
- Date: Sat, 15 Feb 2020 10:19:42 GMT
- Title: Optimization-Based MCMC Methods for Nonlinear Hierarchical Statistical
Inverse Problems
- Authors: Johnathan Bardsley, Tiangang Cui
- Abstract summary: In many hierarchical inverse problems, not only do we want to estimate high- or infinite-dimensional model parameters in the parameter-to-observable maps, but we also have to estimate hyper parameters.
In this work, we aim to develop scalable optimization-based Markov chain Monte Carlo (MCMC) methods for solving hierarchical Bayesian inverse problems.
- Score: 0.6091702876917279
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In many hierarchical inverse problems, not only do we want to estimate high-
or infinite-dimensional model parameters in the parameter-to-observable maps,
but we also have to estimate hyperparameters that represent critical
assumptions in the statistical and mathematical modeling processes. As a joint
effect of high-dimensionality, nonlinear dependence, and non-concave structures
in the joint posterior posterior distribution over model parameters and
hyperparameters, solving inverse problems in the hierarchical Bayesian setting
poses a significant computational challenge. In this work, we aim to develop
scalable optimization-based Markov chain Monte Carlo (MCMC) methods for solving
hierarchical Bayesian inverse problems with nonlinear parameter-to-observable
maps and a broader class of hyperparameters. Our algorithmic development is
based on the recently developed scalable randomize-then-optimize (RTO) method
[4] for exploring the high- or infinite-dimensional model parameter space. By
using RTO either as a proposal distribution in a Metropolis-within-Gibbs update
or as a biasing distribution in the pseudo-marginal MCMC [2], we are able to
design efficient sampling tools for hierarchical Bayesian inversion. In
particular, the integration of RTO and the pseudo-marginal MCMC has sampling
performance robust to model parameter dimensions. We also extend our methods to
nonlinear inverse problems with Poisson-distributed measurements. Numerical
examples in PDE-constrained inverse problems and positron emission tomography
(PET) are used to demonstrate the performance of our methods.
Related papers
- A Statistical Machine Learning Approach for Adapting Reduced-Order Models using Projected Gaussian Process [4.658371840624581]
Proper Orthogonal Decomposition (POD) computes optimal basis modes that span a low-dimensional subspace where the Reduced-Order Models (ROMs) reside.
This paper proposes a Projected Gaussian Process (pGP) and formulates the problem of adapting POD basis as a supervised statistical learning problem.
Numerical examples are presented to demonstrate the advantages of the proposed pGP for adapting POD basis against parameter changes.
arXiv Detail & Related papers (2024-10-18T00:02:43Z) - Total Uncertainty Quantification in Inverse PDE Solutions Obtained with Reduced-Order Deep Learning Surrogate Models [50.90868087591973]
We propose an approximate Bayesian method for quantifying the total uncertainty in inverse PDE solutions obtained with machine learning surrogate models.
We test the proposed framework by comparing it with the iterative ensemble smoother and deep ensembling methods for a non-linear diffusion equation.
arXiv Detail & Related papers (2024-08-20T19:06:02Z) - A Metaheuristic for Amortized Search in High-Dimensional Parameter
Spaces [0.0]
We propose a new metaheuristic that drives dimensionality reductions from feature-informed transformations.
DR-FFIT implements an efficient sampling strategy that facilitates a gradient-free parameter search in high-dimensional spaces.
Our test data show that DR-FFIT boosts the performances of random-search and simulated-annealing against well-established metaheuristics.
arXiv Detail & Related papers (2023-09-28T14:25:14Z) - Provably Efficient Algorithm for Nonstationary Low-Rank MDPs [48.92657638730582]
We make the first effort to investigate nonstationary RL under episodic low-rank MDPs, where both transition kernels and rewards may vary over time.
We propose a parameter-dependent policy optimization algorithm called PORTAL, and further improve PORTAL to its parameter-free version of Ada-PORTAL.
For both algorithms, we provide upper bounds on the average dynamic suboptimality gap, which show that as long as the nonstationarity is not significantly large, PORTAL and Ada-PORTAL are sample-efficient and can achieve arbitrarily small average dynamic suboptimality gap with sample complexity.
arXiv Detail & Related papers (2023-08-10T09:52:44Z) - An iterative multi-fidelity approach for model order reduction of
multi-dimensional input parametric PDE systems [0.0]
We propose a sampling parametric strategy for the reduction of large-scale PDE systems with multidimensional input parametric spaces.
It is achieved by exploiting low-fidelity models throughout the parametric space to sample points using an efficient sampling strategy.
Since the proposed methodology leverages the use of low-fidelity models to assimilate the solution database, it significantly reduces the computational cost in the offline stage.
arXiv Detail & Related papers (2023-01-23T15:25:58Z) - Optimization of Annealed Importance Sampling Hyperparameters [77.34726150561087]
Annealed Importance Sampling (AIS) is a popular algorithm used to estimates the intractable marginal likelihood of deep generative models.
We present a parameteric AIS process with flexible intermediary distributions and optimize the bridging distributions to use fewer number of steps for sampling.
We assess the performance of our optimized AIS for marginal likelihood estimation of deep generative models and compare it to other estimators.
arXiv Detail & Related papers (2022-09-27T07:58:25Z) - Counting Phases and Faces Using Bayesian Thermodynamic Integration [77.34726150561087]
We introduce a new approach to reconstruction of the thermodynamic functions and phase boundaries in two-parametric statistical mechanics systems.
We use the proposed approach to accurately reconstruct the partition functions and phase diagrams of the Ising model and the exactly solvable non-equilibrium TASEP.
arXiv Detail & Related papers (2022-05-18T17:11:23Z) - Bayesian multiscale deep generative model for the solution of
high-dimensional inverse problems [0.0]
A novel multiscale Bayesian inference approach is introduced based on deep probabilistic generative models.
The method allows high-dimensional parameter estimation while exhibiting stability, efficiency and accuracy.
arXiv Detail & Related papers (2021-02-04T11:47:21Z) - Pushing the Envelope of Rotation Averaging for Visual SLAM [69.7375052440794]
We propose a novel optimization backbone for visual SLAM systems.
We leverage averaging to improve the accuracy, efficiency and robustness of conventional monocular SLAM systems.
Our approach can exhibit up to 10x faster with comparable accuracy against the state-art on public benchmarks.
arXiv Detail & Related papers (2020-11-02T18:02:26Z) - Understanding Implicit Regularization in Over-Parameterized Single Index
Model [55.41685740015095]
We design regularization-free algorithms for the high-dimensional single index model.
We provide theoretical guarantees for the induced implicit regularization phenomenon.
arXiv Detail & Related papers (2020-07-16T13:27:47Z) - Bayesian System ID: Optimal management of parameter, model, and
measurement uncertainty [0.0]
We evaluate the robustness of a probabilistic formulation of system identification (ID) to sparse, noisy, and indirect data.
We show that the log posterior has improved geometric properties compared with the objective function surfaces of traditional methods.
arXiv Detail & Related papers (2020-03-04T22:48:30Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.