Rapidly adapting robot swarms with Swarm Map-based Bayesian Optimisation
- URL: http://arxiv.org/abs/2012.11444v1
- Date: Mon, 21 Dec 2020 15:54:37 GMT
- Title: Rapidly adapting robot swarms with Swarm Map-based Bayesian Optimisation
- Authors: David M. Bossens and Danesh Tarapore
- Abstract summary: Rapid performance recovery from unforeseen environmental perturbations remains a grand challenge in swarm robotics.
To solve this challenge, we investigate a behaviour adaptation approach, where one searches an archive of controllers for potential recovery solutions.
We propose two algorithms: (i) Swarm Map-based optimisation (SMBO), which selects and evaluates one controller at a time, for a homogeneous swarm, in a centralised fashion; and (ii) Swarm Map-based optimisation Decentralised (SMBO-Dec), which performs an asynchronous batch-based Bayesian optimisation.
- Score: 1.0152838128195467
- License: http://arxiv.org/licenses/nonexclusive-distrib/1.0/
- Abstract: Rapid performance recovery from unforeseen environmental perturbations
remains a grand challenge in swarm robotics. To solve this challenge, we
investigate a behaviour adaptation approach, where one searches an archive of
controllers for potential recovery solutions. To apply behaviour adaptation in
swarm robotic systems, we propose two algorithms: (i) Swarm Map-based
Optimisation (SMBO), which selects and evaluates one controller at a time, for
a homogeneous swarm, in a centralised fashion; and (ii) Swarm Map-based
Optimisation Decentralised (SMBO-Dec), which performs an asynchronous
batch-based Bayesian optimisation to simultaneously explore different
controllers for groups of robots in the swarm. We set up foraging experiments
with a variety of disturbances: injected faults to proximity sensors, ground
sensors, and the actuators of individual robots, with 100 unique combinations
for each type. We also investigate disturbances in the operating environment of
the swarm, where the swarm has to adapt to drastic changes in the number of
resources available in the environment, and to one of the robots behaving
disruptively towards the rest of the swarm, with 30 unique conditions for each
such perturbation. The viability of SMBO and SMBO-Dec is demonstrated,
comparing favourably to variants of random search and gradient descent, and
various ablations, and improving performance up to 80% compared to the
performance at the time of fault injection within at most 30 evaluations.
Related papers
- Arbitrarily Scalable Environment Generators via Neural Cellular Automata [55.150593161240444]
We show that NCA environment generators maintain consistent, regularized patterns regardless of environment size.
Our method scales a single-agent reinforcement learning policy to arbitrarily large environments with similar patterns.
arXiv Detail & Related papers (2023-10-28T07:30:09Z) - Collective motion emerging from evolving swarm controllers in different
environments using gradient following task [2.7402733069181]
We consider a challenging task where robots with limited sensing and communication abilities must follow the gradient of an environmental feature.
We use Differential Evolution to evolve a neural network controller for simulated Thymio II robots.
Experiments confirm the feasibility of our approach, the evolved robot controllers induced swarm behaviour that solved the task.
arXiv Detail & Related papers (2022-03-22T10:08:50Z) - Centralizing State-Values in Dueling Networks for Multi-Robot
Reinforcement Learning Mapless Navigation [87.85646257351212]
We study the problem of multi-robot mapless navigation in the popular Training and Decentralized Execution (CTDE) paradigm.
This problem is challenging when each robot considers its path without explicitly sharing observations with other robots.
We propose a novel architecture for CTDE that uses a centralized state-value network to compute a joint state-value.
arXiv Detail & Related papers (2021-12-16T16:47:00Z) - SABER: Data-Driven Motion Planner for Autonomously Navigating
Heterogeneous Robots [112.2491765424719]
We present an end-to-end online motion planning framework that uses a data-driven approach to navigate a heterogeneous robot team towards a global goal.
We use model predictive control (SMPC) to calculate control inputs that satisfy robot dynamics, and consider uncertainty during obstacle avoidance with chance constraints.
recurrent neural networks are used to provide a quick estimate of future state uncertainty considered in the SMPC finite-time horizon solution.
A Deep Q-learning agent is employed to serve as a high-level path planner, providing the SMPC with target positions that move the robots towards a desired global goal.
arXiv Detail & Related papers (2021-08-03T02:56:21Z) - Robotic Brain Storm Optimization: A Multi-target Collaborative Searching
Paradigm for Swarm Robotics [24.38312890501329]
This paper proposes a BSO-based collaborative searching framework for swarm robotics called Robotic BSO.
The proposed method can simulate the BSO's guided search characteristics and has an excellent prospect for multi-target searching problems for swarm robotics.
arXiv Detail & Related papers (2021-05-27T13:05:48Z) - Online Body Schema Adaptation through Cost-Sensitive Active Learning [63.84207660737483]
The work was implemented in a simulation environment, using the 7DoF arm of the iCub robot simulator.
A cost-sensitive active learning approach is used to select optimal joint configurations.
The results show cost-sensitive active learning has similar accuracy to the standard active learning approach, while reducing in about half the executed movement.
arXiv Detail & Related papers (2021-01-26T16:01:02Z) - Neuromorphic adaptive spiking CPG towards bio-inspired locomotion of
legged robots [58.720142291102135]
Spiking Central Pattern Generator generates different locomotion patterns driven by an external stimulus.
The locomotion of the end robotic platform (any-legged robot) can be adapted to the terrain by using any sensor as input.
arXiv Detail & Related papers (2021-01-24T12:44:38Z) - Distributed Multi-agent Meta Learning for Trajectory Design in Wireless
Drone Networks [151.27147513363502]
This paper studies the problem of the trajectory design for a group of energyconstrained drones operating in dynamic wireless network environments.
A value based reinforcement learning (VDRL) solution and a metatraining mechanism is proposed.
arXiv Detail & Related papers (2020-12-06T01:30:12Z) - Bioinspired Bipedal Locomotion Control for Humanoid Robotics Based on
EACO [1.0152838128195467]
This work presents promoting global search capability and convergence rate of the EACO applied to humanoid robots in real-time.
We put a special focus on the EACO algorithm on a wide range of problems, from ACO, real-coded GAs, GAs with neural networks(NNs), particle swarm optimization(PSO) to complex robotics systems.
arXiv Detail & Related papers (2020-10-09T09:43:48Z) - AMPSO: Artificial Multi-Swarm Particle Swarm Optimization [5.7415897900373425]
We propose a novel artificial multi-swarm PSO which consists of an exploration swarm, an artificial exploitation swarm and an convergence swarm.
To guarantee the accuracy of the results, a novel diversity scheme is proposed to control the exploration, exploitation and convergence processes of the swarms.
The effectiveness of AMPSO is validated on all the functions in the CEC2015 test suite.
arXiv Detail & Related papers (2020-04-16T09:59:45Z) - QED: using Quality-Environment-Diversity to evolve resilient robot
swarms [12.18340575383456]
In swarm robotics, any of the robots in a swarm may be affected by different faults, resulting in significant performance declines.
One model-free approach to fault recovery involves two phases: during simulation, a quality-diversity algorithm evolves a behaviourally diverse archive of controllers.
The impact of environmental diversity is often ignored in the choice of a suitable behavioural descriptor.
arXiv Detail & Related papers (2020-03-04T21:36:07Z)
This list is automatically generated from the titles and abstracts of the papers in this site.
This site does not guarantee the quality of this site (including all information) and is not responsible for any consequences.