Bayesian Algorithms Learn to Stabilize Unknown Continuous-Time Systems
- URL: http://arxiv.org/abs/2112.15094v1
- Date: Thu, 30 Dec 2021 15:31:35 GMT
- Title: Bayesian Algorithms Learn to Stabilize Unknown Continuous-Time Systems
- Authors: Mohamad Kazem Shirani Faradonbeh, Mohamad Sadegh Shirani Faradonbeh
- Abstract summary: Linear dynamical systems are canonical models for learning-based control of plants with uncertain dynamics.
A reliable stabilization procedure for this purpose that can effectively learn from unstable data to stabilize the system in a finite time is not currently available.
In this work, we propose a novel learning algorithm that stabilizes unknown continuous-time linear systems.
- Score: 0.0
- License: http://creativecommons.org/licenses/by/4.0/
- Abstract: Linear dynamical systems are canonical models for learning-based control of
plants with uncertain dynamics. The setting consists of a stochastic
differential equation that captures the state evolution of the plant
understudy, while the true dynamics matrices are unknown and need to be learned
from the observed data of state trajectory. An important issue is to ensure
that the system is stabilized and destabilizing control actions due to model
uncertainties are precluded as soon as possible. A reliable stabilization
procedure for this purpose that can effectively learn from unstable data to
stabilize the system in a finite time is not currently available. In this work,
we propose a novel Bayesian learning algorithm that stabilizes unknown
continuous-time stochastic linear systems. The presented algorithm is flexible
and exposes effective stabilization performance after a remarkably short time
period of interacting with the system.
Related papers
- Stochastic Reinforcement Learning with Stability Guarantees for Control of Unknown Nonlinear Systems [6.571209126567701]
We propose a reinforcement learning algorithm that stabilizes the system by learning a local linear representation ofthe dynamics.
We demonstrate the effectiveness of our algorithm on several challenging high-dimensional dynamical systems.
arXiv Detail & Related papers (2024-09-12T20:07:54Z) - Learning to Boost the Performance of Stable Nonlinear Systems [0.0]
We tackle the performance-boosting problem with closed-loop stability guarantees.
Our methods enable learning over arbitrarily deep neural network classes of performance-boosting controllers for stable nonlinear systems.
arXiv Detail & Related papers (2024-05-01T21:11:29Z) - Stability-Certified Learning of Control Systems with Quadratic
Nonlinearities [9.599029891108229]
This work primarily focuses on an operator inference methodology aimed at constructing low-dimensional dynamical models.
Our main objective is to develop a method that facilitates the inference of quadratic control dynamical systems with inherent stability guarantees.
arXiv Detail & Related papers (2024-03-01T16:26:47Z) - KCRL: Krasovskii-Constrained Reinforcement Learning with Guaranteed
Stability in Nonlinear Dynamical Systems [66.9461097311667]
We propose a model-based reinforcement learning framework with formal stability guarantees.
The proposed method learns the system dynamics up to a confidence interval using feature representation.
We show that KCRL is guaranteed to learn a stabilizing policy in a finite number of interactions with the underlying unknown system.
arXiv Detail & Related papers (2022-06-03T17:27:04Z) - Joint Learning-Based Stabilization of Multiple Unknown Linear Systems [3.453777970395065]
We propose a novel joint learning-based stabilization algorithm for quickly learning stabilizing policies for all systems understudy.
The presented procedure is shown to be notably effective such that it stabilizes the family of dynamical systems in an extremely short time period.
arXiv Detail & Related papers (2022-01-01T15:30:44Z) - Probabilistic robust linear quadratic regulators with Gaussian processes [73.0364959221845]
Probabilistic models such as Gaussian processes (GPs) are powerful tools to learn unknown dynamical systems from data for subsequent use in control design.
We present a novel controller synthesis for linearized GP dynamics that yields robust controllers with respect to a probabilistic stability margin.
arXiv Detail & Related papers (2021-05-17T08:36:18Z) - Stability and Identification of Random Asynchronous Linear
Time-Invariant Systems [81.02274958043883]
We show the additional benefits of randomization and asynchrony on the stability of linear dynamical systems.
For unknown randomized LTI systems, we propose a systematic identification method to recover the underlying dynamics.
arXiv Detail & Related papers (2020-12-08T02:00:04Z) - Training Generative Adversarial Networks by Solving Ordinary
Differential Equations [54.23691425062034]
We study the continuous-time dynamics induced by GAN training.
From this perspective, we hypothesise that instabilities in training GANs arise from the integration error.
We experimentally verify that well-known ODE solvers (such as Runge-Kutta) can stabilise training.
arXiv Detail & Related papers (2020-10-28T15:23:49Z) - Robust Model-Free Learning and Control without Prior Knowledge [1.14219428942199]
We present a model-free control algorithm that robustly learn and stabilize an unknown discrete-time linear system.
The controller does not require any prior knowledge of the system dynamics, disturbances, or noise.
We will conclude with simulation results that show that despite the generality and simplicity, the controller demonstrates good closed-loop performance.
arXiv Detail & Related papers (2020-10-01T05:43:33Z) - Reinforcement Learning with Fast Stabilization in Linear Dynamical
Systems [91.43582419264763]
We study model-based reinforcement learning (RL) in unknown stabilizable linear dynamical systems.
We propose an algorithm that certifies fast stabilization of the underlying system by effectively exploring the environment.
We show that the proposed algorithm attains $tildemathcalO(sqrtT)$ regret after $T$ time steps of agent-environment interaction.
arXiv Detail & Related papers (2020-07-23T23:06:40Z) - Learning Stabilizing Controllers for Unstable Linear Quadratic
Regulators from a Single Trajectory [85.29718245299341]
We study linear controllers under quadratic costs model also known as linear quadratic regulators (LQR)
We present two different semi-definite programs (SDP) which results in a controller that stabilizes all systems within an ellipsoid uncertainty set.
We propose an efficient data dependent algorithm -- textsceXploration -- that with high probability quickly identifies a stabilizing controller.
arXiv Detail & Related papers (2020-06-19T08:58:57Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.