Decentralized Swarm Collision Avoidance for Quadrotors via End-to-End
Reinforcement Learning
- URL: http://arxiv.org/abs/2104.14912v1
- Date: Fri, 30 Apr 2021 11:19:03 GMT
- Title: Decentralized Swarm Collision Avoidance for Quadrotors via End-to-End
Reinforcement Learning
- Authors: Ramzi Ourari, Kai Cui, Heinz Koeppl
- Abstract summary: We draw biological inspiration from flocks of starlings and apply the insight to end-to-end learned decentralized collision avoidance.
We propose a new, scalable observation model following a biomimetic topological interaction rule.
Our learned policies are tested in simulation and subsequently transferred to real-world drones to validate their real-world applicability.
- Score: 28.592704336574158
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Collision avoidance algorithms are of central interest to many drone
applications. In particular, decentralized approaches may be the key to
enabling robust drone swarm solutions in cases where centralized communication
becomes computationally prohibitive. In this work, we draw biological
inspiration from flocks of starlings (Sturnus vulgaris) and apply the insight
to end-to-end learned decentralized collision avoidance. More specifically, we
propose a new, scalable observation model following a biomimetic topological
interaction rule that leads to stable learning and robust avoidance behavior.
Additionally, prior work primarily focuses on invoking a separation principle,
i.e. designing collision avoidance independent of specific tasks. By applying a
general reinforcement learning approach, we propose a holistic learning-based
approach to integrating collision avoidance with various tasks and dynamics. To
validate the generality of this approach, we successfully apply our methodology
to a number of configurations. Our learned policies are tested in simulation
and subsequently transferred to real-world drones to validate their real-world
applicability.
Related papers
- Generalizability of Graph Neural Networks for Decentralized Unlabeled Motion Planning [72.86540018081531]
Unlabeled motion planning involves assigning a set of robots to target locations while ensuring collision avoidance.
This problem forms an essential building block for multi-robot systems in applications such as exploration, surveillance, and transportation.
We address this problem in a decentralized setting where each robot knows only the positions of its $k$-nearest robots and $k$-nearest targets.
arXiv Detail & Related papers (2024-09-29T23:57:25Z) - Towards Universal Dense Blocking for Entity Resolution [49.06313308481536]
We propose UniBlocker, a dense blocker that is pre-trained on a domain-independent, easily-obtainable corpus.
By conducting domain-independent pre-training, UniBlocker can be adapted to various downstream blocking scenarios without requiring domain-specific fine-tuning.
Our experiments show that the proposed UniBlocker, without any domain-specific learning, significantly outperforms previous self- and unsupervised dense blocking methods.
arXiv Detail & Related papers (2024-04-23T08:39:29Z) - Multi-granular Adversarial Attacks against Black-box Neural Ranking Models [111.58315434849047]
We create high-quality adversarial examples by incorporating multi-granular perturbations.
We transform the multi-granular attack into a sequential decision-making process.
Our attack method surpasses prevailing baselines in both attack effectiveness and imperceptibility.
arXiv Detail & Related papers (2024-04-02T02:08:29Z) - Denoising Heat-inspired Diffusion with Insulators for Collision Free
Motion Planning [3.074694788117593]
Diffusion models have risen as a powerful tool in robotics due to their flexibility and multi-modality.
We present a method that simultaneously generates only reachable goals and plans motions that avoid obstacles.
arXiv Detail & Related papers (2023-10-19T09:39:07Z) - Sequential Neural Barriers for Scalable Dynamic Obstacle Avoidance [7.375976854181687]
We propose a novel method for compositional learning of Sequential Neural Control Barrier models (SNCBFs)
Our approach exploits an important observation: the spatial interaction patterns of multiple dynamic obstacles can be decomposed and predicted through temporal sequences of states for each obstacle.
We demonstrate the benefits of the proposed methods in improving dynamic collision avoidance in comparison with existing methods.
arXiv Detail & Related papers (2023-07-06T14:24:17Z) - Safe Multi-agent Learning via Trapping Regions [89.24858306636816]
We apply the concept of trapping regions, known from qualitative theory of dynamical systems, to create safety sets in the joint strategy space for decentralized learning.
We propose a binary partitioning algorithm for verification that candidate sets form trapping regions in systems with known learning dynamics, and a sampling algorithm for scenarios where learning dynamics are not known.
arXiv Detail & Related papers (2023-02-27T14:47:52Z) - Scalable Task-Driven Robotic Swarm Control via Collision Avoidance and
Learning Mean-Field Control [23.494528616672024]
We use state-of-the-art mean-field control techniques to convert many-agent swarm control into classical single-agent control of distributions.
Here, we combine collision avoidance and learning of mean-field control into a unified framework for tractably designing intelligent robotic swarm behavior.
arXiv Detail & Related papers (2022-09-15T16:15:04Z) - Object-Aware Regularization for Addressing Causal Confusion in Imitation
Learning [131.1852444489217]
This paper presents Object-aware REgularizatiOn (OREO), a technique that regularizes an imitation policy in an object-aware manner.
Our main idea is to encourage a policy to uniformly attend to all semantic objects, in order to prevent the policy from exploiting nuisance variables strongly correlated with expert actions.
arXiv Detail & Related papers (2021-10-27T01:56:23Z) - Congestion-aware Multi-agent Trajectory Prediction for Collision
Avoidance [110.63037190641414]
We propose to learn congestion patterns explicitly and devise a novel "Sense--Learn--Reason--Predict" framework.
By decomposing the learning phases into two stages, a "student" can learn contextual cues from a "teacher" while generating collision-free trajectories.
In experiments, we demonstrate that the proposed model is able to generate collision-free trajectory predictions in a synthetic dataset.
arXiv Detail & Related papers (2021-03-26T02:42:33Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.