Active Uncertainty Reduction for Safe and Efficient Interaction
Planning: A Shielding-Aware Dual Control Approach
- URL: http://arxiv.org/abs/2302.00171v2
- Date: Wed, 1 Nov 2023 17:33:40 GMT
- Title: Active Uncertainty Reduction for Safe and Efficient Interaction
Planning: A Shielding-Aware Dual Control Approach
- Authors: Haimin Hu, David Isele, Sangjae Bae, Jaime F. Fisac
- Abstract summary: We present a novel algorithmic approach to enable active uncertainty reduction for interactive motion planning based on the implicit dual control paradigm.
Our approach relies on sampling-based approximation of dynamic programming, leading to a model predictive control problem that can be readily solved by real-time gradient-based optimization methods.
- Score: 9.07774184840379
- License: http://creativecommons.org/licenses/by-nc-nd/4.0/
- Abstract: The ability to accurately predict others' behavior is central to the safety
and efficiency of interactive robotics. Unfortunately, robots often lack access
to key information on which these predictions may hinge, such as other agents'
goals, attention, and willingness to cooperate. Dual control theory addresses
this challenge by treating unknown parameters of a predictive model as
stochastic hidden states and inferring their values at runtime using
information gathered during system operation. While able to optimally and
automatically trade off exploration and exploitation, dual control is
computationally intractable for general interactive motion planning. In this
paper, we present a novel algorithmic approach to enable active uncertainty
reduction for interactive motion planning based on the implicit dual control
paradigm. Our approach relies on sampling-based approximation of stochastic
dynamic programming, leading to a model predictive control problem that can be
readily solved by real-time gradient-based optimization methods. The resulting
policy is shown to preserve the dual control effect for a broad class of
predictive models with both continuous and categorical uncertainty. To ensure
the safe operation of the interacting agents, we use a runtime safety filter
(also referred to as a "shielding" scheme), which overrides the robot's dual
control policy with a safety fallback strategy when a safety-critical event is
imminent. We then augment the dual control framework with an improved variant
of the recently proposed shielding-aware robust planning scheme, which
proactively balances the nominal planning performance with the risk of
high-cost emergency maneuvers triggered by low-probability agent behaviors. We
demonstrate the efficacy of our approach with both simulated driving studies
and hardware experiments using 1/10 scale autonomous vehicles.
Related papers
- SAFE-SIM: Safety-Critical Closed-Loop Traffic Simulation with Controllable Adversaries [94.84458417662407]
We introduce SAFE-SIM, a novel diffusion-based controllable closed-loop safety-critical simulation framework.
We develop a novel approach to simulate safety-critical scenarios through an adversarial term in the denoising process.
We validate our framework empirically using the NuScenes dataset, demonstrating improvements in both realism and controllability.
arXiv Detail & Related papers (2023-12-31T04:14:43Z) - Model Checking for Closed-Loop Robot Reactive Planning [0.0]
We show how model checking can be used to create multistep plans for a differential drive wheeled robot so that it can avoid immediate danger.
Using a small, purpose built model checking algorithm in situ we generate plans in real-time in a way that reflects the egocentric reactive response of simple biological agents.
arXiv Detail & Related papers (2023-11-16T11:02:29Z) - Actively Learning Reinforcement Learning: A Stochastic Optimal Control
Approach [3.7728340443952577]
We propose a framework towards achieving two intertwined objectives: (i) equipping reinforcement learning with active exploration and deliberate information gathering, such that it regulates state and uncertainties resulting from modeling mismatches and noisy sensory; and (ii) overcoming the huge computational cost of optimal control.
We approach both objectives by using reinforcement learning to attain the optimal control law.
arXiv Detail & Related papers (2023-09-18T18:05:35Z) - Tuning Legged Locomotion Controllers via Safe Bayesian Optimization [47.87675010450171]
This paper presents a data-driven strategy to streamline the deployment of model-based controllers in legged robotic hardware platforms.
We leverage a model-free safe learning algorithm to automate the tuning of control gains, addressing the mismatch between the simplified model used in the control formulation and the real system.
arXiv Detail & Related papers (2023-06-12T13:10:14Z) - Recursively Feasible Probabilistic Safe Online Learning with Control
Barrier Functions [63.18590014127461]
This paper introduces a model-uncertainty-aware reformulation of CBF-based safety-critical controllers.
We study the feasibility of the resulting robust safety-critical controller.
We then use these conditions to devise an event-triggered online data collection strategy.
arXiv Detail & Related papers (2022-08-23T05:02:09Z) - Active Uncertainty Learning for Human-Robot Interaction: An Implicit
Dual Control Approach [5.05828899601167]
We present an algorithmic approach to enable uncertainty learning for human-in-the-loop motion planning based on the implicit dual control paradigm.
Our approach relies on sampling-based approximation of dynamic programming model predictive control problem.
The resulting policy is shown to preserve the dual control effect for generic human predictive models with both continuous and categorical uncertainty.
arXiv Detail & Related papers (2022-02-15T20:40:06Z) - SHARP: Shielding-Aware Robust Planning for Safe and Efficient
Human-Robot Interaction [5.804727815849655]
" Shielding" control scheme overrides the robot's nominal plan with a safety fallback strategy when a safety-critical event is imminent.
We propose a new shielding-based planning approach that allows the robot to plan efficiently by explicitly accounting for possible future shielding events.
arXiv Detail & Related papers (2021-10-02T17:01:59Z) - Deep Structured Reactive Planning [94.92994828905984]
We propose a novel data-driven, reactive planning objective for self-driving vehicles.
We show that our model outperforms a non-reactive variant in successfully completing highly complex maneuvers.
arXiv Detail & Related papers (2021-01-18T01:43:36Z) - Risk-Sensitive Sequential Action Control with Multi-Modal Human
Trajectory Forecasting for Safe Crowd-Robot Interaction [55.569050872780224]
We present an online framework for safe crowd-robot interaction based on risk-sensitive optimal control, wherein the risk is modeled by the entropic risk measure.
Our modular approach decouples the crowd-robot interaction into learning-based prediction and model-based control.
A simulation study and a real-world experiment show that the proposed framework can accomplish safe and efficient navigation while avoiding collisions with more than 50 humans in the scene.
arXiv Detail & Related papers (2020-09-12T02:02:52Z) - The Importance of Prior Knowledge in Precise Multimodal Prediction [71.74884391209955]
Roads have well defined geometries, topologies, and traffic rules.
In this paper we propose to incorporate structured priors as a loss function.
We demonstrate the effectiveness of our approach on real-world self-driving datasets.
arXiv Detail & Related papers (2020-06-04T03:56:11Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.