QED: using Quality-Environment-Diversity to evolve resilient robot
swarms
- URL: http://arxiv.org/abs/2003.02341v1
- Date: Wed, 4 Mar 2020 21:36:07 GMT
- Title: QED: using Quality-Environment-Diversity to evolve resilient robot
swarms
- Authors: David M. Bossens and Danesh Tarapore
- Abstract summary: In swarm robotics, any of the robots in a swarm may be affected by different faults, resulting in significant performance declines.
One model-free approach to fault recovery involves two phases: during simulation, a quality-diversity algorithm evolves a behaviourally diverse archive of controllers.
The impact of environmental diversity is often ignored in the choice of a suitable behavioural descriptor.
- Score: 12.18340575383456
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: In swarm robotics, any of the robots in a swarm may be affected by different
faults, resulting in significant performance declines. To allow fault recovery
from randomly injected faults to different robots in a swarm, a model-free
approach may be preferable due to the accumulation of faults in models and the
difficulty to predict the behaviour of neighbouring robots. One model-free
approach to fault recovery involves two phases: during simulation, a
quality-diversity algorithm evolves a behaviourally diverse archive of
controllers; during the target application, a search for the best controller is
initiated after fault injection. In quality-diversity algorithms, the choice of
the behavioural descriptor is a key design choice that determines the quality
of the evolved archives, and therefore the fault recovery performance. Although
the environment is an important determinant of behaviour, the impact of
environmental diversity is often ignored in the choice of a suitable
behavioural descriptor. This study compares different behavioural descriptors,
including two generic descriptors that work on a wide range of tasks, one
hand-coded descriptor which fits the domain of interest, and one novel type of
descriptor based on environmental diversity, which we call
Quality-Environment-Diversity (QED). Results demonstrate that the
above-mentioned model-free approach to fault recovery is feasible in the
context of swarm robotics, reducing the fault impact by a factor 2-3. Further,
the environmental diversity obtained with QED yields a unique behavioural
diversity profile that allows it to recover from high-impact faults.
Related papers
- Robustness for Free: Quality-Diversity Driven Discovery of Agile Soft
Robotic Gaits [0.7829600874436199]
We show how Quality Diversity Algorithms can produce repertoires of gaits robust to changing terrains.
This robustness significantly out-performs that of gaits produced by a single objective optimization algorithm.
arXiv Detail & Related papers (2023-11-02T14:00:11Z) - Quality Diversity under Sparse Reward and Sparse Interaction:
Application to Grasping in Robotics [0.0]
Quality-Diversity (QD) methods are algorithms that aim to generate a set of diverse and high-performing solutions to a given problem.
The present work studies how QD can address grasping in robotics.
Experiments have been conducted on 15 different methods on 10 grasping domains, corresponding to 2 different robot-gripper setups and 5 standard objects.
arXiv Detail & Related papers (2023-08-10T10:19:48Z) - Causal Disentanglement Hidden Markov Model for Fault Diagnosis [55.90917958154425]
We propose a Causal Disentanglement Hidden Markov model (CDHM) to learn the causality in the bearing fault mechanism.
Specifically, we make full use of the time-series data and progressively disentangle the vibration signal into fault-relevant and fault-irrelevant factors.
To expand the scope of the application, we adopt unsupervised domain adaptation to transfer the learned disentangled representations to other working environments.
arXiv Detail & Related papers (2023-08-06T05:58:45Z) - Distributional Instance Segmentation: Modeling Uncertainty and High
Confidence Predictions with Latent-MaskRCNN [77.0623472106488]
In this paper, we explore a class of distributional instance segmentation models using latent codes.
For robotic picking applications, we propose a confidence mask method to achieve the high precision necessary.
We show that our method can significantly reduce critical errors in robotic systems, including our newly released dataset of ambiguous scenes.
arXiv Detail & Related papers (2023-05-03T05:57:29Z) - The Quality-Diversity Transformer: Generating Behavior-Conditioned
Trajectories with Decision Transformers [3.185440619417202]
Quality-Diversity algorithms have proven effective in generating repertoires of diverse and efficient policies.
In uncertain environments, policies can lack robustness and repeatability.
We present a new approach to achieve behavior-conditioned trajectory generation based on two mechanisms.
arXiv Detail & Related papers (2023-03-27T19:39:49Z) - Estimation of continuous environments by robot swarms: Correlated
networks and decision-making [0.0]
Large-scale multi-robot systems need collective decision-making to establish autonomy on the swarm level.
We propose a control algorithm and study it in real-world robot swarm experiments in different environments.
We show that our approach is effective and achieves higher precision than a control experiment.
arXiv Detail & Related papers (2023-02-27T09:57:15Z) - Learning to Walk Autonomously via Reset-Free Quality-Diversity [73.08073762433376]
Quality-Diversity algorithms can discover large and complex behavioural repertoires consisting of both diverse and high-performing skills.
Existing QD algorithms need large numbers of evaluations as well as episodic resets, which require manual human supervision and interventions.
This paper proposes Reset-Free Quality-Diversity optimization (RF-QD) as a step towards autonomous learning for robotics in open-ended environments.
arXiv Detail & Related papers (2022-04-07T14:07:51Z) - Rapidly adapting robot swarms with Swarm Map-based Bayesian Optimisation [1.0152838128195467]
Rapid performance recovery from unforeseen environmental perturbations remains a grand challenge in swarm robotics.
To solve this challenge, we investigate a behaviour adaptation approach, where one searches an archive of controllers for potential recovery solutions.
We propose two algorithms: (i) Swarm Map-based optimisation (SMBO), which selects and evaluates one controller at a time, for a homogeneous swarm, in a centralised fashion; and (ii) Swarm Map-based optimisation Decentralised (SMBO-Dec), which performs an asynchronous batch-based Bayesian optimisation.
arXiv Detail & Related papers (2020-12-21T15:54:37Z) - Attribute-Guided Adversarial Training for Robustness to Natural
Perturbations [64.35805267250682]
We propose an adversarial training approach which learns to generate new samples so as to maximize exposure of the classifier to the attributes-space.
Our approach enables deep neural networks to be robust against a wide range of naturally occurring perturbations.
arXiv Detail & Related papers (2020-12-03T10:17:30Z) - Risk-Sensitive Sequential Action Control with Multi-Modal Human
Trajectory Forecasting for Safe Crowd-Robot Interaction [55.569050872780224]
We present an online framework for safe crowd-robot interaction based on risk-sensitive optimal control, wherein the risk is modeled by the entropic risk measure.
Our modular approach decouples the crowd-robot interaction into learning-based prediction and model-based control.
A simulation study and a real-world experiment show that the proposed framework can accomplish safe and efficient navigation while avoiding collisions with more than 50 humans in the scene.
arXiv Detail & Related papers (2020-09-12T02:02:52Z) - SUOD: Accelerating Large-Scale Unsupervised Heterogeneous Outlier
Detection [63.253850875265115]
Outlier detection (OD) is a key machine learning (ML) task for identifying abnormal objects from general samples.
We propose a modular acceleration system, called SUOD, to address it.
arXiv Detail & Related papers (2020-03-11T00:22:50Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.