Safe multi-agent motion planning under uncertainty for drones using
filtered reinforcement learning
- URL: http://arxiv.org/abs/2311.00063v1
- Date: Tue, 31 Oct 2023 18:09:26 GMT
- Title: Safe multi-agent motion planning under uncertainty for drones using
filtered reinforcement learning
- Authors: Sleiman Safaoui, Abraham P. Vinod, Ankush Chakrabarty, Rien Quirynen,
Nobuyuki Yoshikawa and Stefano Di Cairano
- Abstract summary: We present a tractable motion planner that builds upon the strengths of reinforcement learning and constrained-control-based trajectory planning.
The proposed approach yields a safe, real-time implementable, multi-agent motion planner that is simpler to train than methods based solely on learning.
- Score: 6.783774261623415
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: We consider the problem of safe multi-agent motion planning for drones in
uncertain, cluttered workspaces. For this problem, we present a tractable
motion planner that builds upon the strengths of reinforcement learning and
constrained-control-based trajectory planning. First, we use single-agent
reinforcement learning to learn motion plans from data that reach the target
but may not be collision-free. Next, we use a convex optimization, chance
constraints, and set-based methods for constrained control to ensure safety,
despite the uncertainty in the workspace, agent motion, and sensing. The
proposed approach can handle state and control constraints on the agents, and
enforce collision avoidance among themselves and with static obstacles in the
workspace with high probability. The proposed approach yields a safe, real-time
implementable, multi-agent motion planner that is simpler to train than methods
based solely on learning. Numerical simulations and experiments show the
efficacy of the approach.
Related papers
- RACER: Epistemic Risk-Sensitive RL Enables Fast Driving with Fewer Crashes [57.319845580050924]
We propose a reinforcement learning framework that combines risk-sensitive control with an adaptive action space curriculum.
We show that our algorithm is capable of learning high-speed policies for a real-world off-road driving task.
arXiv Detail & Related papers (2024-05-07T23:32:36Z) - SAFE-SIM: Safety-Critical Closed-Loop Traffic Simulation with Diffusion-Controllable Adversaries [94.84458417662407]
We introduce SAFE-SIM, a controllable closed-loop safety-critical simulation framework.
Our approach yields two distinct advantages: 1) generating realistic long-tail safety-critical scenarios that closely reflect real-world conditions, and 2) providing controllable adversarial behavior for more comprehensive and interactive evaluations.
We validate our framework empirically using the nuScenes and nuPlan datasets across multiple planners, demonstrating improvements in both realism and controllability.
arXiv Detail & Related papers (2023-12-31T04:14:43Z) - A Safer Vision-based Autonomous Planning System for Quadrotor UAVs with
Dynamic Obstacle Trajectory Prediction and Its Application with LLMs [6.747468447244154]
This paper proposes a vision-based planning system that combines tracking and trajectory prediction of dynamic obstacles to achieve efficient and reliable autonomous flight.
We conduct experiments in both simulation and real-world environments, and the results indicate that our approach can successfully detect and avoid obstacles in dynamic environments in real-time.
arXiv Detail & Related papers (2023-11-21T08:09:00Z) - Active Uncertainty Reduction for Safe and Efficient Interaction
Planning: A Shielding-Aware Dual Control Approach [9.07774184840379]
We present a novel algorithmic approach to enable active uncertainty reduction for interactive motion planning based on the implicit dual control paradigm.
Our approach relies on sampling-based approximation of dynamic programming, leading to a model predictive control problem that can be readily solved by real-time gradient-based optimization methods.
arXiv Detail & Related papers (2023-02-01T01:34:48Z) - Safety Correction from Baseline: Towards the Risk-aware Policy in
Robotics via Dual-agent Reinforcement Learning [64.11013095004786]
We propose a dual-agent safe reinforcement learning strategy consisting of a baseline and a safe agent.
Such a decoupled framework enables high flexibility, data efficiency and risk-awareness for RL-based control.
The proposed method outperforms the state-of-the-art safe RL algorithms on difficult robot locomotion and manipulation tasks.
arXiv Detail & Related papers (2022-12-14T03:11:25Z) - Obstacle Avoidance for UAS in Continuous Action Space Using Deep
Reinforcement Learning [9.891207216312937]
Obstacle avoidance for small unmanned aircraft is vital for the safety of future urban air mobility.
We propose a deep reinforcement learning algorithm based on Proximal Policy Optimization (PPO) to guide autonomous UAS to their destinations.
Results show that the proposed model can provide accurate and robust guidance and resolve conflict with a success rate of over 99%.
arXiv Detail & Related papers (2021-11-13T04:44:53Z) - SHARP: Shielding-Aware Robust Planning for Safe and Efficient
Human-Robot Interaction [5.804727815849655]
" Shielding" control scheme overrides the robot's nominal plan with a safety fallback strategy when a safety-critical event is imminent.
We propose a new shielding-based planning approach that allows the robot to plan efficiently by explicitly accounting for possible future shielding events.
arXiv Detail & Related papers (2021-10-02T17:01:59Z) - Safe Reinforcement Learning via Curriculum Induction [94.67835258431202]
In safety-critical applications, autonomous agents may need to learn in an environment where mistakes can be very costly.
Existing safe reinforcement learning methods make an agent rely on priors that let it avoid dangerous situations.
This paper presents an alternative approach inspired by human teaching, where an agent learns under the supervision of an automatic instructor.
arXiv Detail & Related papers (2020-06-22T10:48:17Z) - The Importance of Prior Knowledge in Precise Multimodal Prediction [71.74884391209955]
Roads have well defined geometries, topologies, and traffic rules.
In this paper we propose to incorporate structured priors as a loss function.
We demonstrate the effectiveness of our approach on real-world self-driving datasets.
arXiv Detail & Related papers (2020-06-04T03:56:11Z) - Enhancing Lattice-based Motion Planning with Introspective Learning and
Reasoning [3.2689702143620143]
This work is concerned with introspective learning and reasoning about controller performance over time.
Normal controller execution of the different actions is learned using reliable and uncertainty-aware machine learning techniques.
Reasoning takes place to both verify that the learned models stays safe and to improve collision checking effectiveness in the motion planner.
arXiv Detail & Related papers (2020-05-15T07:16:51Z) - Chance-Constrained Trajectory Optimization for Safe Exploration and
Learning of Nonlinear Systems [81.7983463275447]
Learning-based control algorithms require data collection with abundant supervision for training.
We present a new approach for optimal motion planning with safe exploration that integrates chance-constrained optimal control with dynamics learning and feedback control.
arXiv Detail & Related papers (2020-05-09T05:57:43Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.