Online Control Barrier Functions for Decentralized Multi-Agent
Navigation
- URL: http://arxiv.org/abs/2303.04313v2
- Date: Fri, 8 Sep 2023 19:28:41 GMT
- Title: Online Control Barrier Functions for Decentralized Multi-Agent
Navigation
- Authors: Zhan Gao and Guang Yang and Amanda Prorok
- Abstract summary: Control barrier functions (CBFs) enable safe multi-agent navigation in the continuous domain.
Traditional approaches consider fixed CBFs, where parameters are tuned apriori.
We propose online CBFs, whereby hyper parameters are tuned in real-time.
- Score: 15.876920170393168
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Control barrier functions (CBFs) enable guaranteed safe multi-agent
navigation in the continuous domain. The resulting navigation performance,
however, is highly sensitive to the underlying hyperparameters. Traditional
approaches consider fixed CBFs (where parameters are tuned apriori), and hence,
typically do not perform well in cluttered and highly dynamic environments:
conservative parameter values can lead to inefficient agent trajectories, or
even failure to reach goal positions, whereas aggressive parameter values can
lead to infeasible controls. To overcome these issues, in this paper, we
propose online CBFs, whereby hyperparameters are tuned in real-time, as a
function of what agents perceive in their immediate neighborhood. Since the
explicit relationship between CBFs and navigation performance is hard to model,
we leverage reinforcement learning to learn CBF-tuning policies in a model-free
manner. Because we parameterize the policies with graph neural networks (GNNs),
we are able to synthesize decentralized agent controllers that adjust parameter
values locally, varying the degree of conservative and aggressive behaviors
across agents. Simulations as well as real-world experiments show that (i)
online CBFs are capable of solving navigation scenarios that are infeasible for
fixed CBFs, and (ii), that they improve navigation performance by adapting to
other agents and changes in the environment.
Related papers
- Domain Adaptive Safety Filters via Deep Operator Learning [5.62479170374811]
We propose a self-supervised deep operator learning framework that learns the mapping from environmental parameters to the corresponding CBF.
We demonstrate the effectiveness of the method through numerical experiments on navigation tasks involving dynamic obstacles.
arXiv Detail & Related papers (2024-10-18T15:10:55Z) - Reinforcement Learning-based Receding Horizon Control using Adaptive Control Barrier Functions for Safety-Critical Systems [14.166970599802324]
Optimal control methods provide solutions to safety-critical problems but easily become intractable.
We propose a Reinforcement Learning-based Receding Horizon Control approach leveraging Model Predictive Control.
We validate our method by applying it to the challenging automated merging control problem for Connected and Automated Vehicles.
arXiv Detail & Related papers (2024-03-26T02:49:08Z) - Leveraging Low-Rank and Sparse Recurrent Connectivity for Robust
Closed-Loop Control [63.310780486820796]
We show how a parameterization of recurrent connectivity influences robustness in closed-loop settings.
We find that closed-form continuous-time neural networks (CfCs) with fewer parameters can outperform their full-rank, fully-connected counterparts.
arXiv Detail & Related papers (2023-10-05T21:44:18Z) - Safe Neural Control for Non-Affine Control Systems with Differentiable
Control Barrier Functions [58.19198103790931]
This paper addresses the problem of safety-critical control for non-affine control systems.
It has been shown that optimizing quadratic costs subject to state and control constraints can be sub-optimally reduced to a sequence of quadratic programs (QPs) by using Control Barrier Functions (CBFs)
We incorporate higher-order CBFs into neural ordinary differential equation-based learning models as differentiable CBFs to guarantee safety for non-affine control systems.
arXiv Detail & Related papers (2023-09-06T05:35:48Z) - Meta-Learning Priors for Safe Bayesian Optimization [72.8349503901712]
We build on a meta-learning algorithm, F-PACOH, capable of providing reliable uncertainty quantification in settings of data scarcity.
As core contribution, we develop a novel framework for choosing safety-compliant priors in a data-riven manner.
On benchmark functions and a high-precision motion system, we demonstrate that our meta-learned priors accelerate the convergence of safe BO approaches.
arXiv Detail & Related papers (2022-10-03T08:38:38Z) - BarrierNet: A Safety-Guaranteed Layer for Neural Networks [50.86816322277293]
BarrierNet allows the safety constraints of a neural controller be adaptable to changing environments.
We evaluate them on a series of control problems such as traffic merging and robot navigations in 2D and 3D space.
arXiv Detail & Related papers (2021-11-22T15:38:11Z) - Pointwise Feasibility of Gaussian Process-based Safety-Critical Control
under Model Uncertainty [77.18483084440182]
Control Barrier Functions (CBFs) and Control Lyapunov Functions (CLFs) are popular tools for enforcing safety and stability of a controlled system, respectively.
We present a Gaussian Process (GP)-based approach to tackle the problem of model uncertainty in safety-critical controllers that use CBFs and CLFs.
arXiv Detail & Related papers (2021-06-13T23:08:49Z) - Learning Control Barrier Functions from Expert Demonstrations [69.23675822701357]
We propose a learning based approach to safe controller synthesis based on control barrier functions (CBFs)
We analyze an optimization-based approach to learning a CBF that enjoys provable safety guarantees under suitable Lipschitz assumptions on the underlying dynamical system.
To the best of our knowledge, these are the first results that learn provably safe control barrier functions from data.
arXiv Detail & Related papers (2020-04-07T12:29:06Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.