Real-Time Adaptive Safety-Critical Control with Gaussian Processes in
High-Order Uncertain Models
- URL: http://arxiv.org/abs/2402.18946v2
- Date: Tue, 5 Mar 2024 09:00:29 GMT
- Title: Real-Time Adaptive Safety-Critical Control with Gaussian Processes in
High-Order Uncertain Models
- Authors: Yu Zhang, Long Wen, Xiangtong Yao, Zhenshan Bing, Linghuan Kong, Wei
He, and Alois Knoll
- Abstract summary: This paper presents an adaptive online learning framework for systems with uncertain parameters.
We first integrate a forgetting factor to refine a variational sparse GP algorithm.
In the second phase, we propose a safety filter based on high-order control barrier functions.
- Score: 14.790031018404942
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: This paper presents an adaptive online learning framework for systems with
uncertain parameters to ensure safety-critical control in non-stationary
environments. Our approach consists of two phases. The initial phase is
centered on a novel sparse Gaussian process (GP) framework. We first integrate
a forgetting factor to refine a variational sparse GP algorithm, thus enhancing
its adaptability. Subsequently, the hyperparameters of the Gaussian model are
trained with a specially compound kernel, and the Gaussian model's online
inferential capability and computational efficiency are strengthened by
updating a solitary inducing point derived from new samples, in conjunction
with the learned hyperparameters. In the second phase, we propose a safety
filter based on high-order control barrier functions (HOCBFs), synergized with
the previously trained learning model. By leveraging the compound kernel from
the first phase, we effectively address the inherent limitations of GPs in
handling high-dimensional problems for real-time applications. The derived
controller ensures a rigorous lower bound on the probability of satisfying the
safety specification. Finally, the efficacy of our proposed algorithm is
demonstrated through real-time obstacle avoidance experiments executed using
both a simulation platform and a real-world 7-DOF robot.
Related papers
- Towards Continual Learning Desiderata via HSIC-Bottleneck
Orthogonalization and Equiangular Embedding [55.107555305760954]
We propose a conceptually simple yet effective method that attributes forgetting to layer-wise parameter overwriting and the resulting decision boundary distortion.
Our method achieves competitive accuracy performance, even with absolute superiority of zero exemplar buffer and 1.02x the base model.
arXiv Detail & Related papers (2024-01-17T09:01:29Z) - Online Probabilistic Model Identification using Adaptive Recursive MCMC [8.465242072268019]
We suggest the Adaptive Recursive Markov Chain Monte Carlo (ARMCMC) method.
It eliminates the shortcomings of conventional online techniques while computing the entire probability density function of model parameters.
We demonstrate our approach using parameter estimation in a soft bending actuator and the Hunt-Crossley dynamic model.
arXiv Detail & Related papers (2022-10-23T02:06:48Z) - Log Barriers for Safe Black-box Optimization with Application to Safe
Reinforcement Learning [72.97229770329214]
We introduce a general approach for seeking high dimensional non-linear optimization problems in which maintaining safety during learning is crucial.
Our approach called LBSGD is based on applying a logarithmic barrier approximation with a carefully chosen step size.
We demonstrate the effectiveness of our approach on minimizing violation in policy tasks in safe reinforcement learning.
arXiv Detail & Related papers (2022-07-21T11:14:47Z) - Gaussian Process Uniform Error Bounds with Unknown Hyperparameters for
Safety-Critical Applications [71.23286211775084]
We introduce robust Gaussian process uniform error bounds in settings with unknown hyper parameters.
Our approach computes a confidence region in the space of hyper parameters, which enables us to obtain a probabilistic upper bound for the model error.
Experiments show that the bound performs significantly better than vanilla and fully Bayesian processes.
arXiv Detail & Related papers (2021-09-06T17:10:01Z) - Hybrid Gaussian Process Modeling Applied to Economic Stochastic Model
Predictive Control of Batch Processes [0.0]
Plant models can often be determined from first principles, parts of the model are difficult to derive using physical laws alone.
This paper exploits GPs to model the parts of the dynamic system that are difficult to describe using first principles.
It is vital to account for this uncertainty in the control algorithm, to prevent constraint violations and performance deterioration.
arXiv Detail & Related papers (2021-08-14T00:01:42Z) - Derivative-Free Policy Optimization for Risk-Sensitive and Robust
Control Design: Implicit Regularization and Sample Complexity [15.940861063732608]
Direct policy search serves as one of the workhorses in modern reinforcement learning (RL)
We investigate the convergence theory of policy robustness (PG) methods for the linear risk-sensitive and robust controller.
One feature of our algorithms is that during the learning phase, a certain level complexity/risk-sensitivity controller is preserved.
arXiv Detail & Related papers (2021-01-04T16:00:46Z) - Gaussian Process-based Min-norm Stabilizing Controller for
Control-Affine Systems with Uncertain Input Effects and Dynamics [90.81186513537777]
We propose a novel compound kernel that captures the control-affine nature of the problem.
We show that this resulting optimization problem is convex, and we call it Gaussian Process-based Control Lyapunov Function Second-Order Cone Program (GP-CLF-SOCP)
arXiv Detail & Related papers (2020-11-14T01:27:32Z) - Learning Control Barrier Functions from Expert Demonstrations [69.23675822701357]
We propose a learning based approach to safe controller synthesis based on control barrier functions (CBFs)
We analyze an optimization-based approach to learning a CBF that enjoys provable safety guarantees under suitable Lipschitz assumptions on the underlying dynamical system.
To the best of our knowledge, these are the first results that learn provably safe control barrier functions from data.
arXiv Detail & Related papers (2020-04-07T12:29:06Z) - Adaptive Control and Regret Minimization in Linear Quadratic Gaussian
(LQG) Setting [91.43582419264763]
We propose LqgOpt, a novel reinforcement learning algorithm based on the principle of optimism in the face of uncertainty.
LqgOpt efficiently explores the system dynamics, estimates the model parameters up to their confidence interval, and deploys the controller of the most optimistic model.
arXiv Detail & Related papers (2020-03-12T19:56:38Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.