Correct-by-construction reach-avoid control of partially observable
linear stochastic systems
- URL: http://arxiv.org/abs/2103.02398v4
- Date: Tue, 12 Sep 2023 07:48:23 GMT
- Title: Correct-by-construction reach-avoid control of partially observable
linear stochastic systems
- Authors: Thom Badings, Hasan A. Poonawala, Marielle Stoelinga, Nils Jansen
- Abstract summary: We formalize a robust feedback controller for reach-avoid control of discrete-time, linear time-invariant systems.
The problem is to compute a controller that satisfies the required provestate abstraction problem.
- Score: 7.912008109232803
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We study feedback controller synthesis for reach-avoid control of
discrete-time, linear time-invariant (LTI) systems with Gaussian process and
measurement noise. The problem is to compute a controller such that, with at
least some required probability, the system reaches a desired goal state in
finite time while avoiding unsafe states. Due to stochasticity and
nonconvexity, this problem does not admit exact algorithmic or closed-form
solutions in general. Our key contribution is a correct-by-construction
controller synthesis scheme based on a finite-state abstraction of a Gaussian
belief over the unmeasured state, obtained using a Kalman filter. We formalize
this abstraction as a Markov decision process (MDP). To be robust against
numerical imprecision in approximating transition probabilities, we use MDPs
with intervals of transition probabilities. By construction, any policy on the
abstraction can be refined into a piecewise linear feedback controller for the
LTI system. We prove that the closed-loop LTI system under this controller
satisfies the reach-avoid problem with at least the required probability. The
numerical experiments show that our method is able to solve reach-avoid
problems for systems with up to 6D state spaces, and with control input
constraints that cannot be handled by methods such as the rapidly-exploring
random belief trees (RRBT).
Related papers
- Robust Control for Dynamical Systems With Non-Gaussian Noise via Formal
Abstractions [59.605246463200736]
We present a novel controller synthesis method that does not rely on any explicit representation of the noise distributions.
First, we abstract the continuous control system into a finite-state model that captures noise by probabilistic transitions between discrete states.
We use state-of-the-art verification techniques to provide guarantees on the interval Markov decision process and compute a controller for which these guarantees carry over to the original control system.
arXiv Detail & Related papers (2023-01-04T10:40:30Z) - Formal Controller Synthesis for Markov Jump Linear Systems with
Uncertain Dynamics [64.72260320446158]
We propose a method for synthesising controllers for Markov jump linear systems.
Our method is based on a finite-state abstraction that captures both the discrete (mode-jumping) and continuous (stochastic linear) behaviour of the MJLS.
We apply our method to multiple realistic benchmark problems, in particular, a temperature control and an aerial vehicle delivery problem.
arXiv Detail & Related papers (2022-12-01T17:36:30Z) - Sampling-Based Robust Control of Autonomous Systems with Non-Gaussian
Noise [59.47042225257565]
We present a novel planning method that does not rely on any explicit representation of the noise distributions.
First, we abstract the continuous system into a discrete-state model that captures noise by probabilistic transitions between states.
We capture these bounds in the transition probability intervals of a so-called interval Markov decision process (iMDP)
arXiv Detail & Related papers (2021-10-25T06:18:55Z) - Finite-time System Identification and Adaptive Control in Autoregressive
Exogenous Systems [79.67879934935661]
We study the problem of system identification and adaptive control of unknown ARX systems.
We provide finite-time learning guarantees for the ARX systems under both open-loop and closed-loop data collection.
arXiv Detail & Related papers (2021-08-26T18:00:00Z) - Formal Verification of Stochastic Systems with ReLU Neural Network
Controllers [22.68044012584378]
We address the problem of formal safety verification for cyber-physical systems equipped with ReLU neural network (NN) controllers.
Our goal is to find the set of initial states from where, with a predetermined confidence, the system will not reach an unsafe configuration.
arXiv Detail & Related papers (2021-03-08T23:53:13Z) - Control of Stochastic Quantum Dynamics with Differentiable Programming [0.0]
We propose a framework for the automated design of control schemes based on differentiable programming.
We apply this approach to state preparation and stabilization of a qubit subjected to homodyne detection.
Despite the resulting poor signal-to-noise ratio, we can train our controller to prepare and stabilize the qubit to a target state with a mean fidelity around 85%.
arXiv Detail & Related papers (2021-01-04T19:00:03Z) - Learning Stabilizing Controllers for Unstable Linear Quadratic
Regulators from a Single Trajectory [85.29718245299341]
We study linear controllers under quadratic costs model also known as linear quadratic regulators (LQR)
We present two different semi-definite programs (SDP) which results in a controller that stabilizes all systems within an ellipsoid uncertainty set.
We propose an efficient data dependent algorithm -- textsceXploration -- that with high probability quickly identifies a stabilizing controller.
arXiv Detail & Related papers (2020-06-19T08:58:57Z) - Adaptive Control and Regret Minimization in Linear Quadratic Gaussian
(LQG) Setting [91.43582419264763]
We propose LqgOpt, a novel reinforcement learning algorithm based on the principle of optimism in the face of uncertainty.
LqgOpt efficiently explores the system dynamics, estimates the model parameters up to their confidence interval, and deploys the controller of the most optimistic model.
arXiv Detail & Related papers (2020-03-12T19:56:38Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.