Networked Online Learning for Control of Safety-Critical
Resource-Constrained Systems based on Gaussian Processes
- URL: http://arxiv.org/abs/2202.11491v1
- Date: Wed, 23 Feb 2022 13:12:12 GMT
- Title: Networked Online Learning for Control of Safety-Critical
Resource-Constrained Systems based on Gaussian Processes
- Authors: Armin Lederer, Mingmin Zhang, Samuel Tesfazgi, Sandra Hirche
- Abstract summary: We propose a novel networked online learning approach based on Gaussian process regression.
We propose an effective data transmission scheme between the local system and the cloud taking bandwidth limitations and time delay of the transmission channel into account.
- Score: 9.544146562919792
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Safety-critical technical systems operating in unknown environments require
the ability to quickly adapt their behavior, which can be achieved in control
by inferring a model online from the data stream generated during operation.
Gaussian process-based learning is particularly well suited for safety-critical
applications as it ensures bounded prediction errors. While there exist
computationally efficient approximations for online inference, these approaches
lack guarantees for the prediction error and have high memory requirements, and
are therefore not applicable to safety-critical systems with tight memory
constraints. In this work, we propose a novel networked online learning
approach based on Gaussian process regression, which addresses the issue of
limited local resources by employing remote data management in the cloud. Our
approach formally guarantees a bounded tracking error with high probability,
which is exploited to identify the most relevant data to achieve a certain
control performance. We further propose an effective data transmission scheme
between the local system and the cloud taking bandwidth limitations and time
delay of the transmission channel into account. The effectiveness of the
proposed method is successfully demonstrated in a simulation.
Related papers
- Privacy-Preserving Distributed Learning for Residential Short-Term Load
Forecasting [11.185176107646956]
Power system load data can inadvertently reveal the daily routines of residential users, posing a risk to their property security.
We introduce a Markovian Switching-based distributed training framework, the convergence of which is substantiated through rigorous theoretical analysis.
Case studies employing real-world power system load data validate the efficacy of our proposed algorithm.
arXiv Detail & Related papers (2024-02-02T16:39:08Z) - Verification of Neural Reachable Tubes via Scenario Optimization and Conformal Prediction [10.40899456282141]
Hamilton-Jacobi reachability analysis is a popular formal verification tool for providing such guarantees.
DeepReach has been used to synthesize reachable tubes and safety controllers for high-dimensional systems.
We propose two verification methods, based on robust scenario optimization and conformal prediction, to provide probabilistic safety guarantees.
arXiv Detail & Related papers (2023-12-14T02:03:36Z) - In-Distribution Barrier Functions: Self-Supervised Policy Filters that
Avoid Out-of-Distribution States [84.24300005271185]
We propose a control filter that wraps any reference policy and effectively encourages the system to stay in-distribution with respect to offline-collected safe demonstrations.
Our method is effective for two different visuomotor control tasks in simulation environments, including both top-down and egocentric view settings.
arXiv Detail & Related papers (2023-01-27T22:28:19Z) - Recursively Feasible Probabilistic Safe Online Learning with Control Barrier Functions [60.26921219698514]
We introduce a model-uncertainty-aware reformulation of CBF-based safety-critical controllers.
We then present the pointwise feasibility conditions of the resulting safety controller.
We use these conditions to devise an event-triggered online data collection strategy.
arXiv Detail & Related papers (2022-08-23T05:02:09Z) - Sample-efficient Safe Learning for Online Nonlinear Control with Control
Barrier Functions [35.9713619595494]
Reinforcement Learning and continuous nonlinear control have been successfully deployed in multiple domains of complicated sequential decision-making tasks.
Given the exploration nature of the learning process and the presence of model uncertainty, it is challenging to apply them to safety-critical control tasks.
We propose a emphprovably efficient episodic safe learning framework for online control tasks.
arXiv Detail & Related papers (2022-07-29T00:54:35Z) - Log Barriers for Safe Black-box Optimization with Application to Safe
Reinforcement Learning [72.97229770329214]
We introduce a general approach for seeking high dimensional non-linear optimization problems in which maintaining safety during learning is crucial.
Our approach called LBSGD is based on applying a logarithmic barrier approximation with a carefully chosen step size.
We demonstrate the effectiveness of our approach on minimizing violation in policy tasks in safe reinforcement learning.
arXiv Detail & Related papers (2022-07-21T11:14:47Z) - Learning Robust Output Control Barrier Functions from Safe Expert Demonstrations [50.37808220291108]
This paper addresses learning safe output feedback control laws from partial observations of expert demonstrations.
We first propose robust output control barrier functions (ROCBFs) as a means to guarantee safety.
We then formulate an optimization problem to learn ROCBFs from expert demonstrations that exhibit safe system behavior.
arXiv Detail & Related papers (2021-11-18T23:21:00Z) - Model-Free Learning of Optimal Deterministic Resource Allocations in
Wireless Systems via Action-Space Exploration [4.721069729610892]
We propose a technically grounded and scalable deterministic-dual gradient policy method for efficiently learning optimal parameterized resource allocation policies.
Our method not only efficiently exploits gradient availability of popular universal representations such as deep networks, but is also truly model-free, as it relies on consistent zeroth-order gradient approximations of associated random network services constructed via low-dimensional perturbations in action space.
arXiv Detail & Related papers (2021-08-23T18:26:16Z) - Probabilistic robust linear quadratic regulators with Gaussian processes [73.0364959221845]
Probabilistic models such as Gaussian processes (GPs) are powerful tools to learn unknown dynamical systems from data for subsequent use in control design.
We present a novel controller synthesis for linearized GP dynamics that yields robust controllers with respect to a probabilistic stability margin.
arXiv Detail & Related papers (2021-05-17T08:36:18Z) - Non-Episodic Learning for Online LQR of Unknown Linear Gaussian System [0.0]
We propose an online non-episodic algorithm that gains knowledge about the system from a single trajectory.
We characterize the almost sure convergence rates of identification and control, and reveal an optimal trade-off between exploration and exploitation.
arXiv Detail & Related papers (2021-03-24T15:51:28Z) - Chance-Constrained Trajectory Optimization for Safe Exploration and
Learning of Nonlinear Systems [81.7983463275447]
Learning-based control algorithms require data collection with abundant supervision for training.
We present a new approach for optimal motion planning with safe exploration that integrates chance-constrained optimal control with dynamics learning and feedback control.
arXiv Detail & Related papers (2020-05-09T05:57:43Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.