Real-Time Adaptive Safety-Critical Control with Gaussian Processes in
High-Order Uncertain Models
- URL: http://arxiv.org/abs/2402.18946v2
- Date: Tue, 5 Mar 2024 09:00:29 GMT
- Title: Real-Time Adaptive Safety-Critical Control with Gaussian Processes in
High-Order Uncertain Models
- Authors: Yu Zhang, Long Wen, Xiangtong Yao, Zhenshan Bing, Linghuan Kong, Wei
He, and Alois Knoll
- Abstract summary: This paper presents an adaptive online learning framework for systems with uncertain parameters.
We first integrate a forgetting factor to refine a variational sparse GP algorithm.
In the second phase, we propose a safety filter based on high-order control barrier functions.
- Score: 14.790031018404942
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: This paper presents an adaptive online learning framework for systems with
uncertain parameters to ensure safety-critical control in non-stationary
environments. Our approach consists of two phases. The initial phase is
centered on a novel sparse Gaussian process (GP) framework. We first integrate
a forgetting factor to refine a variational sparse GP algorithm, thus enhancing
its adaptability. Subsequently, the hyperparameters of the Gaussian model are
trained with a specially compound kernel, and the Gaussian model's online
inferential capability and computational efficiency are strengthened by
updating a solitary inducing point derived from new samples, in conjunction
with the learned hyperparameters. In the second phase, we propose a safety
filter based on high-order control barrier functions (HOCBFs), synergized with
the previously trained learning model. By leveraging the compound kernel from
the first phase, we effectively address the inherent limitations of GPs in
handling high-dimensional problems for real-time applications. The derived
controller ensures a rigorous lower bound on the probability of satisfying the
safety specification. Finally, the efficacy of our proposed algorithm is
demonstrated through real-time obstacle avoidance experiments executed using
both a simulation platform and a real-world 7-DOF robot.
Related papers
- Towards safe and tractable Gaussian process-based MPC: Efficient sampling within a sequential quadratic programming framework [35.79393879150088]
We propose a robust GP-MPC formulation that guarantees constraint satisfaction with high probability.
We highlight the improved reachable set approximation compared to existing methods, as well as real-time feasible times.
arXiv Detail & Related papers (2024-09-13T08:15:20Z) - Safe Bayesian Optimization for High-Dimensional Control Systems via Additive Gaussian Processes [2.3342885570554652]
We propose a high-dimensional safe Bayesian optimization method based on additive Gaussian processes to optimize multiple controllers simultaneously and safely.
Experimental results on a permanent magnet synchronous motor (PMSM) demonstrate that our method can obtain optimal parameters more efficiently while ensuring safety.
arXiv Detail & Related papers (2024-08-29T07:12:37Z) - Towards Continual Learning Desiderata via HSIC-Bottleneck
Orthogonalization and Equiangular Embedding [55.107555305760954]
We propose a conceptually simple yet effective method that attributes forgetting to layer-wise parameter overwriting and the resulting decision boundary distortion.
Our method achieves competitive accuracy performance, even with absolute superiority of zero exemplar buffer and 1.02x the base model.
arXiv Detail & Related papers (2024-01-17T09:01:29Z) - Log Barriers for Safe Black-box Optimization with Application to Safe
Reinforcement Learning [72.97229770329214]
We introduce a general approach for seeking high dimensional non-linear optimization problems in which maintaining safety during learning is crucial.
Our approach called LBSGD is based on applying a logarithmic barrier approximation with a carefully chosen step size.
We demonstrate the effectiveness of our approach on minimizing violation in policy tasks in safe reinforcement learning.
arXiv Detail & Related papers (2022-07-21T11:14:47Z) - Gaussian Process Uniform Error Bounds with Unknown Hyperparameters for
Safety-Critical Applications [71.23286211775084]
We introduce robust Gaussian process uniform error bounds in settings with unknown hyper parameters.
Our approach computes a confidence region in the space of hyper parameters, which enables us to obtain a probabilistic upper bound for the model error.
Experiments show that the bound performs significantly better than vanilla and fully Bayesian processes.
arXiv Detail & Related papers (2021-09-06T17:10:01Z) - Hybrid Gaussian Process Modeling Applied to Economic Stochastic Model
Predictive Control of Batch Processes [0.0]
Plant models can often be determined from first principles, parts of the model are difficult to derive using physical laws alone.
This paper exploits GPs to model the parts of the dynamic system that are difficult to describe using first principles.
It is vital to account for this uncertainty in the control algorithm, to prevent constraint violations and performance deterioration.
arXiv Detail & Related papers (2021-08-14T00:01:42Z) - Gaussian Process-based Min-norm Stabilizing Controller for
Control-Affine Systems with Uncertain Input Effects and Dynamics [90.81186513537777]
We propose a novel compound kernel that captures the control-affine nature of the problem.
We show that this resulting optimization problem is convex, and we call it Gaussian Process-based Control Lyapunov Function Second-Order Cone Program (GP-CLF-SOCP)
arXiv Detail & Related papers (2020-11-14T01:27:32Z) - Learning Control Barrier Functions from Expert Demonstrations [69.23675822701357]
We propose a learning based approach to safe controller synthesis based on control barrier functions (CBFs)
We analyze an optimization-based approach to learning a CBF that enjoys provable safety guarantees under suitable Lipschitz assumptions on the underlying dynamical system.
To the best of our knowledge, these are the first results that learn provably safe control barrier functions from data.
arXiv Detail & Related papers (2020-04-07T12:29:06Z) - Adaptive Control and Regret Minimization in Linear Quadratic Gaussian
(LQG) Setting [91.43582419264763]
We propose LqgOpt, a novel reinforcement learning algorithm based on the principle of optimism in the face of uncertainty.
LqgOpt efficiently explores the system dynamics, estimates the model parameters up to their confidence interval, and deploys the controller of the most optimistic model.
arXiv Detail & Related papers (2020-03-12T19:56:38Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.