Safe Online Dynamics Learning with Initially Unknown Models and
Infeasible Safety Certificates
- URL: http://arxiv.org/abs/2311.02133v1
- Date: Fri, 3 Nov 2023 14:23:57 GMT
- Title: Safe Online Dynamics Learning with Initially Unknown Models and
Infeasible Safety Certificates
- Authors: Alexandre Capone, Ryan Cosner, Aaron Ames, Sandra Hirche
- Abstract summary: This paper considers a learning-based setting with a robust safety certificate based on a control barrier function (CBF) second-order cone program.
If the control barrier function certificate is feasible, our approach leverages it to guarantee safety. Otherwise, our method explores the system dynamics to collect data and recover the feasibility of the control barrier function constraint.
- Score: 45.72598064481916
- License: http://creativecommons.org/publicdomain/zero/1.0/
- Abstract: Safety-critical control tasks with high levels of uncertainty are becoming
increasingly common. Typically, techniques that guarantee safety during
learning and control utilize constraint-based safety certificates, which can be
leveraged to compute safe control inputs. However, excessive model uncertainty
can render robust safety certification methods or infeasible, meaning no
control input satisfies the constraints imposed by the safety certificate. This
paper considers a learning-based setting with a robust safety certificate based
on a control barrier function (CBF) second-order cone program. If the control
barrier function certificate is feasible, our approach leverages it to
guarantee safety. Otherwise, our method explores the system dynamics to collect
data and recover the feasibility of the control barrier function constraint. To
this end, we employ a method inspired by well-established tools from Bayesian
optimization. We show that if the sampling frequency is high enough, we recover
the feasibility of the robust CBF certificate, guaranteeing safety. Our
approach requires no prior model and corresponds, to the best of our knowledge,
to the first algorithm that guarantees safety in settings with occasionally
infeasible safety certificates without requiring a backup non-learning-based
controller.
Related papers
- Transfer of Safety Controllers Through Learning Deep Inverse Dynamics Model [4.7962647777554634]
Control barrier certificates have proven effective in formally guaranteeing the safety of the control systems.
Design of a control barrier certificate is a time-consuming and computationally expensive endeavor.
We propose a validity condition that, when met, guarantees correctness of the controller.
arXiv Detail & Related papers (2024-05-22T15:28:43Z) - Sampling-based Safe Reinforcement Learning for Nonlinear Dynamical
Systems [15.863561935347692]
We develop provably safe and convergent reinforcement learning algorithms for control of nonlinear dynamical systems.
Recent advances at the intersection of control and RL follow a two-stage, safety filter approach to enforcing hard safety constraints.
We develop a single-stage, sampling-based approach to hard constraint satisfaction that learns RL controllers enjoying classical convergence guarantees.
arXiv Detail & Related papers (2024-03-06T19:39:20Z) - ISAACS: Iterative Soft Adversarial Actor-Critic for Safety [0.9217021281095907]
This work introduces a novel approach enabling scalable synthesis of robust safety-preserving controllers for robotic systems.
A safety-seeking fallback policy is co-trained with an adversarial "disturbance" agent that aims to invoke the worst-case realization of model error.
While the learned control policy does not intrinsically guarantee safety, it is used to construct a real-time safety filter.
arXiv Detail & Related papers (2022-12-06T18:53:34Z) - Meta-Learning Priors for Safe Bayesian Optimization [72.8349503901712]
We build on a meta-learning algorithm, F-PACOH, capable of providing reliable uncertainty quantification in settings of data scarcity.
As core contribution, we develop a novel framework for choosing safety-compliant priors in a data-riven manner.
On benchmark functions and a high-precision motion system, we demonstrate that our meta-learned priors accelerate the convergence of safe BO approaches.
arXiv Detail & Related papers (2022-10-03T08:38:38Z) - Enforcing Hard Constraints with Soft Barriers: Safe Reinforcement
Learning in Unknown Stochastic Environments [84.3830478851369]
We propose a safe reinforcement learning approach that can jointly learn the environment and optimize the control policy.
Our approach can effectively enforce hard safety constraints and significantly outperform CMDP-based baseline methods in system safe rate measured via simulations.
arXiv Detail & Related papers (2022-09-29T20:49:25Z) - Recursively Feasible Probabilistic Safe Online Learning with Control Barrier Functions [60.26921219698514]
We introduce a model-uncertainty-aware reformulation of CBF-based safety-critical controllers.
We then present the pointwise feasibility conditions of the resulting safety controller.
We use these conditions to devise an event-triggered online data collection strategy.
arXiv Detail & Related papers (2022-08-23T05:02:09Z) - Safe Reinforcement Learning via Confidence-Based Filters [78.39359694273575]
We develop a control-theoretic approach for certifying state safety constraints for nominal policies learned via standard reinforcement learning techniques.
We provide formal safety guarantees, and empirically demonstrate the effectiveness of our approach.
arXiv Detail & Related papers (2022-07-04T11:43:23Z) - Joint Differentiable Optimization and Verification for Certified
Reinforcement Learning [91.93635157885055]
In model-based reinforcement learning for safety-critical control systems, it is important to formally certify system properties.
We propose a framework that jointly conducts reinforcement learning and formal verification.
arXiv Detail & Related papers (2022-01-28T16:53:56Z) - Joint Synthesis of Safety Certificate and Safe Control Policy using
Constrained Reinforcement Learning [7.658716383823426]
A valid safety certificate is an energy function indicating that safe states are with low energy.
Existing learning-based studies treat the safety certificate and the safe control policy as prior knowledge to learn the other.
This paper proposes a novel approach that simultaneously synthesizes the energy-function-based safety certificate and learns the safe control policy with CRL.
arXiv Detail & Related papers (2021-11-15T12:05:44Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.