Probabilistic Control and Majorization of Optimal Control
- URL: http://arxiv.org/abs/2205.03279v5
- Date: Wed, 15 Nov 2023 10:43:40 GMT
- Title: Probabilistic Control and Majorization of Optimal Control
- Authors: Tom Lefebvre
- Abstract summary: Probabilistic control design is founded on the principle that a rational agent attempts to match modelled with an arbitrary desired closed-loop system trajectory density.
In this work we introduce an alternative parametrization of desired closed-loop behaviour and explore alternative proximity measures between densities.
- Score: 3.2634122554914002
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: Probabilistic control design is founded on the principle that a rational
agent attempts to match modelled with an arbitrary desired closed-loop system
trajectory density. The framework was originally proposed as a tractable
alternative to traditional optimal control design, parametrizing desired
behaviour through fictitious transition and policy densities and using the
information projection as a proximity measure. In this work we introduce an
alternative parametrization of desired closed-loop behaviour and explore
alternative proximity measures between densities. It is then illustrated how
the associated probabilistic control problems solve into uncertain or
probabilistic policies. Our main result is to show that the probabilistic
control objectives majorize conventional, stochastic and risk sensitive,
optimal control objectives. This observation allows us to identify two
probabilistic fixed point iterations that converge to the deterministic optimal
control policies establishing an explicit connection between either
formulations. Further we demonstrate that the risk sensitive optimal control
formulation is also technically equivalent to a Maximum Likelihood estimation
problem on a probabilistic graph model where the notion of costs is directly
encoded into the model. The associated treatment of the estimation problem is
then shown to coincide with the moment projected probabilistic control
formulation. That way optimal decision making can be reformulated as an
iterative inference problem. Based on these insights we discuss directions for
algorithmic development.
Related papers
- A successive approximation method in functional spaces for hierarchical optimal control problems and its application to learning [0.0]
We consider a class of learning problem of point estimation for modeling high-dimensional nonlinear functions.
The estimated parameter in due course provides an acceptable prediction accuracy on a different model validation dataset.
We provide a framework how to account appropriately for both generalization and regularization at the optimization stage.
arXiv Detail & Related papers (2024-10-27T22:28:07Z) - Likelihood Ratio Confidence Sets for Sequential Decision Making [51.66638486226482]
We revisit the likelihood-based inference principle and propose to use likelihood ratios to construct valid confidence sequences.
Our method is especially suitable for problems with well-specified likelihoods.
We show how to provably choose the best sequence of estimators and shed light on connections to online convex optimization.
arXiv Detail & Related papers (2023-11-08T00:10:21Z) - Model Predictive Control with Gaussian-Process-Supported Dynamical
Constraints for Autonomous Vehicles [82.65261980827594]
We propose a model predictive control approach for autonomous vehicles that exploits learned Gaussian processes for predicting human driving behavior.
A multi-mode predictive control approach considers the possible intentions of the human drivers.
arXiv Detail & Related papers (2023-03-08T17:14:57Z) - Probabilistic learning constrained by realizations using a weak
formulation of Fourier transform of probability measures [0.0]
This paper deals with the taking into account a given set of realizations as constraints in the Kullback-Leibler minimum principle.
A functional approach is developed on the basis of a weak formulation of the Fourier transform of probability measures.
The presented application in high dimension demonstrates the efficiency and the robustness of the proposed algorithm.
arXiv Detail & Related papers (2022-05-06T08:54:57Z) - Integrated Conditional Estimation-Optimization [6.037383467521294]
Many real-world optimization problems uncertain parameters with probability can be estimated using contextual feature information.
In contrast to the standard approach of estimating the distribution of uncertain parameters, we propose an integrated conditional estimation approach.
We show that our ICEO approach is theally consistent under moderate conditions.
arXiv Detail & Related papers (2021-10-24T04:49:35Z) - Stochastic Control through Approximate Bayesian Input Inference [23.65155934960922]
Optimal control under uncertainty is a prevailing challenge in control, due to the difficulty in producing tractable solutions for the optimization problem.
By framing the control problem as one of input estimation, advanced approximate inference techniques can be used to handle the statistical approximations in a principled and practical manner.
arXiv Detail & Related papers (2021-05-17T09:27:12Z) - Probabilistic robust linear quadratic regulators with Gaussian processes [73.0364959221845]
Probabilistic models such as Gaussian processes (GPs) are powerful tools to learn unknown dynamical systems from data for subsequent use in control design.
We present a novel controller synthesis for linearized GP dynamics that yields robust controllers with respect to a probabilistic stability margin.
arXiv Detail & Related papers (2021-05-17T08:36:18Z) - Stein Variational Model Predictive Control [130.60527864489168]
Decision making under uncertainty is critical to real-world, autonomous systems.
Model Predictive Control (MPC) methods have demonstrated favorable performance in practice, but remain limited when dealing with complex distributions.
We show that this framework leads to successful planning in challenging, non optimal control problems.
arXiv Detail & Related papers (2020-11-15T22:36:59Z) - Amortized Conditional Normalized Maximum Likelihood: Reliable Out of
Distribution Uncertainty Estimation [99.92568326314667]
We propose the amortized conditional normalized maximum likelihood (ACNML) method as a scalable general-purpose approach for uncertainty estimation.
Our algorithm builds on the conditional normalized maximum likelihood (CNML) coding scheme, which has minimax optimal properties according to the minimum description length principle.
We demonstrate that ACNML compares favorably to a number of prior techniques for uncertainty estimation in terms of calibration on out-of-distribution inputs.
arXiv Detail & Related papers (2020-11-05T08:04:34Z) - Control as Hybrid Inference [62.997667081978825]
We present an implementation of CHI which naturally mediates the balance between iterative and amortised inference.
We verify the scalability of our algorithm on a continuous control benchmark, demonstrating that it outperforms strong model-free and model-based baselines.
arXiv Detail & Related papers (2020-07-11T19:44:09Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.